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Introduction 



About this Specification 



History 



Scope 



This document describes TIFF, a tag-based file format for storing and interchang- 
ing raster images. 



The first version of the TIFF specification was published by Aldus Corporation in 
the fall of 1986, after a series of meetings with various scanner manufacturers and 
software developers. It did not have a revision number but should have been la- 
beled Revision 3.0 since there were two major earlier draft releases. 



Revision 4.0 contained mostly minor enhancements and was released in April 
1987. Revision 5.0, released in October 1988, added support for palette color 
images and LZW compression. 



TIFF describes image data that typically comes from scanners, frame grabbers, 
and paint- and photo-retouching programs. 

TIFF is not a printer language or page description language. The purpose of TIFF 
is to describe and store raster image data. 

A primary goal of TIFF is to provide a rich environment within which applica- 
tions can exchange image data. This richness is required to take advantage of the 
varying capabilities of scanners and other imaging devices. 

Though TIFF is a rich format, it can easily be used for simple scanners and appli- 
cations as well because the number of required fields is small. 

TIFF will be enhanced on a continuing basis as new imaging needs arise. A high 
priority has been given to structuring TIFF so that future enhancements can be 
added without causing unnecessary hardship to developers. 
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Features 



• TIFF is capable of describing bilevel, grayscale, palette-color, and full-color 
image data in several color spaces. 

• TIFF includes a number of compression schemes that allow developers to 
choose the best space or time tradeoff for their applications. 

• TIFF is not tied to specific scanners, printers, or computer display hardware. 

• TIFF is portable. It does not favor particular operating systems, file systems, 
compilers, or processors. 

• TIFF is designed to be extensible — to evolve gracefully as new needs arise. 

• TIFF allows the inclusion of an unlimited amount of private or special-purpose 
information. 
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Revision Notes 



Minor changes to TIFF 6.0, March 1995 



Updated contact information and TIFF administration policies, since Aldus Cor- 
poration merged with Adobe Systems Incorporated on September 1, 1994. 



The technical content and pagination are unchanged from the original June 3, 
1992 release. 



TIFF 5.0 to TIFF 6.0 



This revision replaces TIFF Revision 5.0. 

In the main body of the document, paragraphs that contain new or substantially - 
changed information are shown in italics. 



New Features In Revision 6.0 



Clarifications 



Major enhancements to TIFF 6.0 are described in Part 2. They include: 

• CMYK image definition 

• A revised RGB Colorimetry section. 

• YCbCr image definition 

• CIE L*a*b* image definition 

• Tiled image definition 

• JPEG compression 



• The LZW compression section more clearly explains when to switch the cod- 
ing bit length. 

• The interaction between Compression=2 (CCITT Huffman) and 
Photometriclnterpretation was clarified. 

• The data organization of uncompressed data (Compression= 1) when 
BitsPerSample is greater than 8 was clarified. See the Compression field de- 
scription. 

• The discussion of CCITT Group 3 and Group 4 bilevel image encodings was 
clarified and expanded, and Group30ptions and Group40ptions fields were 
renamed T40ptions and T60ptions. See Section 1 1. 
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Organizational Changes 



To make the organization more consistent and expandable, appendices were 
transformed into numbered sections. 

The document was divided into two parts — Baseline and Extensions — to help 
developers make better and more consistent implementation choices. Part 1, 
the Baseline section, describes those features that all general-purpose TIFF 
readers should support. Part 2, the Extensions section, describes a number of 
features that can be used by special or advanced applications. 

An index and table of contents were added. 



Changes in Requirements 



To illustrate a Baseline TIFF file earlier in the document, the material from 
Appendix G ("TIFF Classes") in Revision 5 was integrated into the main body 
of the specification . As part of this integration, the TIFF Classes terminology 
was replaced by the more monolithic Baseline TIFF terminology. The intent 
was to further encourage all mainstream TIFF readers to support the Baseline 
TIFF requirements for bilevel, grayscale, RGB, and palette-color images. 

Due to licensing issues, LZW compression support was moved out of the "Part 
1: Baseline TIFF" and into "Part 2: Extensions." 

Baseline TIFF requirements for bit depths in palette-color images were weak- 
ened a bit. 



Changes in Terminology 



In previous versions of the specification, the term "tag" reffered both to the identi- 
fying number of a TIFF field and to the entire field. In this version, the term "tag" 
refers only to the identifying number. The term "field" refers to the entire field, 
including the value. 



Compatibility 



Every attempt has been made to add functionality in such a way as to minimize 
compatibility problems with files and software that were based on earlier versions 
of the TIFF specification. The goal is that TIFF files should never become obso- 
lete and that TIFF software should not have to be revised more frequently than 
absolutely necessary. In particular, Baseline TIFF 6.0 files will generally be read- 
able even by older applications that assume TIFF 5.0 or an earlier version of the 
specification. 



However, TIFF 6.0 files that use one of the major new extensions, such as a new 
compression scheme or color space, will not be successfully read by older soft- 
ware. In such cases, the older applications must gracefully give up and refuse to 
import the image, providing the user with a reasonably informative message. 
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TIFF Administration 



Information and Support 



The most recent version of the TIFF specification is available in PDF format on 
the Adobe WWW and ftp servers See the cover page of the specification for the 
required addresses. 

Because of the widespread use of TIFF for in many environments, Adobe is un- 
able to provide a general consulting service for TIFF implementors. TIFF devel- 
opers are encouraged to study sample TIFF files, read TIFF documentation 
thoroughly, and work with developers of other products that are important to you. 

If your TIFF question specifically concerns compatibility with an Adobe Systems 
product, please contact Adobe Developer Supportatdevsup-person@adobe.com. 

Most companies that use TIFF can answer questions about support for TIFF in 
their products. Contact the appropriate product manager or developer support 
service group. 



Private Fields and Values 



An organization might wish to store information meaningful to only that organi- 
zation in a TIFF file. Tags numbered 32768 or higher, sometimes called private 
tags, are reserved for that purpose. 



Upon request, the TIFF administrator (send email to devsup-person@adobe.com) 
will allocate and register one or more private tags for an organization, to avoid 
possible conflicts with other organizations. You do not need to tell the TIFF ad- 
ministrator what you plan to use them for, but giving us this information may help 
other developers to avoid some duplication of effort. We will likely make the tag 
database public at some point. 

Private enumerated values can be accommodated in a similar fashion. For ex- 
ample, you may wish to experiment with a new compression scheme within TIFF. 
Enumeration constants numbered 32768 or higher are reserved for private usage. 
Upon request, the administrator will allocate and register one or more enumerated 
values for a particular field (Compression, in our example), to avoid possible 
conflicts. 

Tags and values allocated in the private number range are not prohibited from 
being included in a future revision of this specification. Several such instances 
exist in the current TIFF specification. 

Do not choose your own tag numbers. Doing so could cause serious compatibility 
problems in the future. However, if there is little or no chance that your TIFF files 
will escape your private environment, please consider using TIFF tags in the 
"reusable" 65000-65535 range. You do not need to contact Adobe when using 
numbers in this range. 
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If you need more than 10 tags, we suggest that you reserve a single private tag, 
define it as a LONG TIFF data type, and use its value as a pointer (offset) to a 
private IFD or other data structure of your choosing. Within that IFD, you can use 
whatever tags you want, since no one else will know that it is an IFD unless you 
tell them. 



Submitting a Proposal 



Any person or group that wants to propose a change or addition to the TIFF speci- 
fication should prepare a proposal that includes the following information: 

• Name of the person or group making the request, and your affiliation. 

• The reason for the request. 

• A list of changes exactly as you propose that they appear in the specification. 
Use inserts, callouts, or other obvious editorial techniques to indicate areas of 
change, and number each change. 

• Discussion of the potential impact on the installed base. 

• A list of contacts outside your company that support your position. Include 
their affiliation. 

Please send your proposal to devsup-person@adobe.com. 

The TIFF Advisory Committee 

The TIFF Advisory Committee is a working group of TIFF experts from a number 
of hardware and software manufacturers. It was formed in the spring of 199 1 to 
provide a forum for debating and refining proposals for the 6.0 release of the TIFF 
specification. 

If you are a TIFF expert and think you have the time and interest to work on this 
committee, contact devsup-person@adobe.com for further information. For the 
TIFF 6.0 release, the group met every two or three months, usually on the west 
coast of the U.S. Accessibility via Internet email is a requirement for membership, 
since that has proven to be an invaluable means for getting work done between 
meetings. 



Other TIFF Extensions 



The Aldus TIFF sections on CompuServe and AppleLink (new location is under 
construction; check the Adobe WWW home page (http://www.adobe.com) for 
future developements) will contain proposed TIFF extensions from other compa- 
nies that are not approved by Adobe as part of Baseline TIFF. 



These proposals typically represent specialized uses of TIFF that do not fall 
within the domain of publishing or general graphics or picture interchange. Gen- 
erally, these features will not be widely supported. If you do write files that incor- 
porate these extensions, be sure to either not call them TIFF files or mark them in 
some way so that they will not be confused with mainstream TIFF files. 
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If you have such a document, send it to devsup-person@adobe.com. All submis- 
sions must be PDF documents or simple text. Be sure to include contact informa- 
tion — at least an email address. 
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Part 1 : Baseline TIFF 



The TIFF specification is divided into two parts. Part 1 describes Baseline TIFF. 
Baseline TIFF is the core of TIFF, the essentials that all mainstream TIFF devel- 
opers should support in their products. 



11 



TIFF 6.0 Specification Final — June 3, 1992 



Section 1 : Notation 



Decimal and Hexadecimal 



Unless otherwise noted, all numeric values in this document are expressed in 
decimal. (".H" is appended to hexidecimal values.) 



Compliance 



Is and shall indicate mandatory requirements. All compliant writers and readers 
must meet the specification. 

Should indicates a recommendation. 

May indicates an option. 

Features designated 'not recommended for general data interchange ' are consid- 
ered extensions to Baseline TIFF. Files that use such features shall be designated 
"Extended TIFF 6.0" files, and the particular extensions used should be docu- 
mented. A Baseline TIFF 6.0 reader is not required to support any extensions. 
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Section 2: TIFF Structure 



TIFF is an image file format. In this document, a. file is defined to be a sequence of 
8-bit bytes, where the bytes are numbered from to N. The largest possible TIFF 
file is 2**32 bytes in length. 

A TIFF file begins with an 8-byte image file header that points to an image file 
directory (IFD). An image file directory contains information about the image, as 
well as pointers to the actual image data. 

The following paragraphs describe the image file header and IFD in more detail. 

See Figure 1 . 



Image File Header 



A TIFF file begins with an 8-byte image file header, containing the following 
information: 

Bytes 0- 1 : The byte order used within the file. Legal values are: 

"II" (4949 .H) 

"MM" (4D4D.H) 

In the "II" format, byte order is always from the least significant byte to the most 
significant byte, for both 16-bit and 32-bit integers This is called little-endian byte 
order. In the "MM" format, byte order is always from most significant to least 
significant, for both 16-bit and 32-bit integers. This is called big-endian byte 
order. 

Bytes 2-3 An arbitrary but carefully chosen number (42) that further identifies the file as a 
TIFF file. 

The byte order depends on the value of Bytes 0-1. 

Bytes 4-7 The offset (in bytes) of the first IFD. The directory may be at any location in the 
file after the header but must begin on a word boundary. In particular, an Image 
File Directory may follow the image data it describes. Readers must follow the 
pointers wherever they may lead. 

The term byte offset is always used in this document to refer to a location with 
respect to the beginning of the TIFF file. The first byte of the file has an offset of 
0. 
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Figure 1 
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Byte Order 



Offset of 0th IFD 



Number of Directory Entries 
Directory Entry 
Directory Entry 1 
Directory Entry 2 

Offset of next IFD 



Directory Entry 



Type 



Value or Offset 



V 



Image File Directory 



An Image File Directory (IFD) consists of a 2-by te count of the number of direc- 
tory entries (i.e., the number of fields), followed by a sequence of 12-byte field 
entries, followed by a 4-byte offset of the next IFD (or if none). (Do not forget to 
write the 4 bytes of after the last IFD.) 



There must be at least 1 IFD in a TIFF file and each IFD must have at least one 
entry. 

See Figure 1 . 

IFD Entry 

Each 12-byte IFD entry has the following format: 

Bytes 0-1 The Tag that identifies the field. 

Bytes 2-3 The field Type. 

Bytes 4-7 The number of values, Count of the indicated Type. 
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Bytes 8-11 The Value Offset, the file offset (in bytes) of the Value for the field. 
The Value is expected to begin on a word boundary; the correspond- 
ing Value Offset will thus be an even number. This file offset may 
point anywhere in the file, even after the image data. 



IFD Terminology 

A TIFF field is a logical entity consisting of TIFF tag and its value. This logical 
concept is implemented as an IFD Entry, plus the actual value if it doesn't fit into 
the value/offset part, the last 4 bytes of the IFD Entry. The terms TIFF field and 
IFD entry are interchangeable in most contexts. 



Sort Order 

The entries in an IFD must be sorted in ascending order by Tag. Note that this is 
not the order in which the fields are described in this document. The Values to 
which directory entries point need not be in any particular order in the file. 



Value/Offset 

To save time and space the Value Offset contains the Value instead of pointing to 
the Value if and only if the Value fits into 4 bytes . If the Value is shorter than 4 
bytes, it is left-justified within the 4-byte Value Offset, i.e., stored in the lower- 
numbered bytes. Whether the Value fits within 4 bytes is determined by the Type 
and Count of the field. 



Count 

Count — called Length in previous versions of the specification — is the number of 
values. Note that Count is not the total number of bytes. For example, a single 16- 
bit word (SHORT) has a Count of 1 ; not 2. 



Types 

The field types and their sizes are: 

1 = BYTE 8-bit unsigned integer. 

2 = ASCII 8-bit byte that contains a 7-bit ASCII code; the last byte 

must be NUL (binary zero). 

3 = SHORT 16-bit (2 -byte) unsigned integer. 

4 = LONG 32-bit (4-byte) unsigned integer. 

5 = RATIONAL Two LONGs: the first represents the numerator of a 

fraction; the second, the denominator. 

The value of the Count part of an ASCII field entry includes the NUL. If padding 
is necessary, the Count does not include the pad byte. Note that there is no initial 
"count byte" as in Pascal-style strings. 



15 



TIFF 6.0 Specification Final — June 3, 1992 



Any ASCII field can contain multiple strings, each terminated with a NUL. A 
single string is preferred whenever possible. The Count for multi-string fields is 
the number of bytes in all the strings in that field plus their terminating NUL 
bytes. Only one NUL is allowed between strings, so that the strings following the 
first string will often begin on an odd byte. 

The reader must check the type to verify that it contains an expected value. TIFF 
currently allows more than 1 valid type for some fields. For example, ImageWidth 
and ImageLength are usually specified as having type SHORT. But images with 
more than 64K rows or columns must use the LONG field type. 

TIFF readers should accept BYTE, SHORT, or LONG values for any unsigned 
integer field. This allows a single procedure to retrieve any integer value, makes 
reading more robust, and saves disk space in some situations. 

In TIFF 6.0, some new field types have been defined: 

6 = SBYTE An 8-bit signed (twos-complement) integer. 

7 = UNDEFINED An 8-bit byte that may contain anything, depending on 

the definition of the field. 

8 = SSHORT A 16-bit (2 -byte) signed (twos-complement) integer. 

9 = SLONG A 32-bit (4-byte) signed (twos-complement) integer. 

10 = SRATIONAL Two SLONG's: the first represents the numerator of a 

fraction, the second the denominator. 

11= FLOAT Single precision (4-byte) IEEE format. 

12 = DOUBLE Double precision (8-byte) IEEE format. 

These new field types are also governed by the byte order (II or MM) in the TIFF 
header. 

Warning: It is possible that other TIFF field types will be added in the future. 
Readers should skip over fields containing an unexpected field type. 



Fields are arrays 

Each TIFF field has an associated Count. This means that all fields are actually 
one- dimensional arrays, even though most fields contain only a single value. 

For example, to store a complicated data structure in a single private field, use 
the UNDEFINED field type and set the Count to the number of bytes required to 
hold the data structure. 



Multiple Images per TIFF File 



There may be more than one IFD in a TIFF file. Each IFD defines a subfile. One 
potential use of subfiles is to describe related images, such as the pages of a fac- 
simile transmission. A Baseline TIFF reader is not required to read any IFDs 
beyond the first one. 
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Section 3: Bilevel Images 



Color 



Now that the overall TIFF structure has been described, we can move on to filling 
the structure with actual fields (tags and values) that describe raster image data. 

To make all of this clearer, the discussion will be organized according to the four 
Baseline TIFF image types: bilevel, grayscale, palette-color, and full-color im- 
ages. This section describes bilevel images. 

Fields required to describe bilevel images are introduced and described briefly 
here. Full descriptions of each field can be found in Section 8. 



A bilevel image contains two colors — black and white. TIFF allows an applica- 
tion to write out bilevel data in either a white-is-zero or black-is-zero format. The 
field that records this information is called Photometriclnterpretation. 



1 = 



Photometriclnterpretation 



Tag =262(106.H) 

Type = SHORT 

Values: 

WhitelsZero. For bilevel and grayscale images: is imaged as white. The maxi- 
mum value is imaged as black. This is the normal value for Compression=2. 

BlacklsZero. For bilevel and grayscale images: is imaged as black. The maxi- 
mum value is imaged as white. If this value is specified for Compression=2, the 
image should display and print reversed. 



Compression 



Data can be stored either compressed or uncompressed. 

Compression 

Tag =259 (103.H) 
Type = SHORT 
Values: 

1 = No compression, but pack data into bytes as tightly as possible, leaving no unused 

bits (except at the end of a row). The component values are stored as an array of 
type BYTE. Each scan line (row) is padded to the next BYTE boundary. 

2 = CCITT Group 3 1-Dimensional Modified Huffman run length encoding. See 
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Section 10 for a description of Modified Huffman Compression. 

32773 = PackBits compression, a simple byte-oriented ran length scheme. See the 
PackBits section for details. 

Data compression applies only to raster image data. All other TIFF fields are 
unaffected. 

Baseline TIFF readers must handle all three compression schemes. 

Rows and Columns 

An image is organized as a rectangular array of pixels. The dimensions of this 
array are stored in the following fields: 

ImageLength 

Tag =257 (101.H) 
Type = SHORT or LONG 

The number of rows (sometimes described as scanlines) in the image. 

ImageWidth 

Tag =256 (100.H) 
Type = SHORT or LONG 

The number of columns in the image, i.e., the number of pixels per scanline. 



Physical Dimensions 



Applications often want to know the size of the picture represented by an image. 
This information can be calculated from ImageWidth and ImageLength given the 
following resolution data: 



ResolutionUnit 

Tag = 296 (128.H) 
Type = SHORT 
Values: 

1 = No absolute unit of measurement. Used for images that may have a non-square 

aspect ratio but no meaningful absolute dimensions. 

2 = Inch. 

3 = Centimeter. 

Default =2 (inch). 
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XResolution 

Tag =282 (11A.H) 

Type = RATIONAL 

The number of pixels per ResolutionUnit in the ImageWidth (typically, horizontal 
- see Orientation) direction. 

YResolution 

Tag =283 (11B.H) 

Type = RATIONAL 

The number of pixels per ResolutionUnit in the ImageLength (typically, vertical) 
direction. 

Location of the Data 



Compressed or uncompressed image data can be stored almost anywhere in a 
TIFF file. TIFF also supports breaking an image into separate strips for increased 
editing flexibility and efficient I/O buffering. The location and size of each strip is 
given by the following fields: 



RowsPerStrip 

Tag =278 (116.H) 
Type = SHORT or LONG 

The number of rows in each strip (except possibly the last strip.) 

For example, if ImageLength is 24, and RowsPerStrip is 10, then there are 3 
strips, with 10 rows in the first strip, 10 rows in the second strip, and 4 rows in the 
third strip. (The data in the last strip is not padded with 6 extra rows of dummy 
data.) 

StripOffsets 

Tag =273 (lll.H) 
Type = SHORT or LONG 

For each strip, the byte offset of that strip. 

StripByteCounts 

Tag =279 (117.H) 
Type = SHORT or LONG 

For each strip, the number of bytes in that strip after any compression. 



19 



TIFF 6.0 Specification 



Final — June 3, 1992 



Putting it all together (along with a couple of less-important fields that are dis- 
cussed later), a sample bilevel image file might contain the following fields: 



A Sample Bilevel TIFF File 



Offset 


Description 


(hex) 




Header 




0000 


Byte Order 


0002 


42 


0004 


1st IFD offset 


IFD: 




0014 


Number of Directory Entries 


0016 


NewSubfileType 


0022 


ImageWidth 


002E 


ImageLength 


003A 


Compression 


0046 


Photometriclnterpretation 


0052 


StripOffsets 


005E 


RowsPerStrip 


006A 


StripByteCounts 


0076 


XResolution 


0082 


YResolution 


008E 


Software 


009A 


DateTime 


00A6 


Next IFD offset 


Values longer than 4 bytes: 


00B6 


StripOffsets 


03A6 


StripByteCounts 


0696 


XResolution 


069E 


YResolution 


06A6 


Software 


06B6 


DateTime 


Image Data: 


00000700 


xxxxxxxx 


xxxxxxxx 


xxxxxxxx 


End of example 



Value 

(numeric values are expressed in hexadecimal notation) 



002A 








00000014 








oooc 








00FE 


0004 


00000001 


00000000 


0100 


0004 


00000001 


000007D0 


0101 


0004 


00000001 


00000BB8 


0103 


0003 


00000001 


8005 0000 


0106 


0003 


00000001 


0001 0000 


0111 


0004 


000000BC 000000B6 


0116 


0004 


00000001 


00000010 


0117 


0003 


000000BC 000003 A6 


011A 


0005 


00000001 


00000696 


011B 


0005 


00000001 


0000069E 


0131 


0002 


0000000E 


000006A6 


0132 


0002 


00000014 


000006B6 


00000000 









OffsetO, Offset 1, ... Offset 187 
CountO, Countl, ... Countl87 
00000 12C 00000001 
0000012C 00000001 
"PageMaker 4.0" 
"1988:02:18 13:59:59" 

Compressed data for strip 10 
Compressed data for strip 179 
Compressed data for strip 53 
Compressed data for strip 160 



20 



TIFF 6.0 Specification Final — June 3, 1992 



Comments on the Bi level Image Example 

• The IFD in this example starts at 14h. It could have started anywhere in the file 
providing the offset was an even number greater than or equal to 8 (since the 
TIFF header is always the first 8 bytes of a TIFF file). 

• With 16 rows per strip, there are 188 strips in all. 

• The example uses a number of optional fields such as DateTime. TIFF readers 
must safely skip over these fields if they do not understand or do not wish to 
use the information. Baseline TIFF readers must not require that such fields be 
present. 

• To make a point, this example has highly-fragmented image data. The strips of 
the image are not in sequential order. The point of this example is to illustrate 
that strip offsets must not be ignored. Never assume that strip N+l follows 
strip N on disk. It is not required that the image data follow the IFD informa- 
tion. 



Required Fields for Bi level Images 



Here is a list of required fields for Baseline TIFF bilevel images. The fields are 
listed in numerical order, as they would appear in the IFD. Note that the previous 
example omits some of these fields. This is permitted because the fields that were 
omitted each have a default and the default is appropriate for this file. 



TagName 


Decimal 


Hex 


Type 


Value 


ImageWidth 


256 


100 


SHORT or LONG 




ImageLength 


257 


101 


SHORT or LONG 




Compression 


259 


103 


SHORT 


1,2 or 32773 


Photometriclnterpretation 


262 


106 


SHORT 


Oor 1 


StripOffsets 


273 


111 


SHORT or LONG 




RowsPerStrip 


278 


116 


SHORT or LONG 




StripByteCounts 


279 


117 


LONG or SHORT 




XResolution 


282 


11A 


RATIONAL 




YResolution 


283 


11B 


RATIONAL 




ResolutionUnit 


296 


128 


SHORT 


l,2or3 



Baseline TIFF bilevel images were called TIFF Class B images in earlier versions 
of the TIFF specification. 



21 



TIFF 6.0 Specification 



Final — June 3, 1992 



Section 4: Grayscale Images 



Grayscale images are a generalization of bilevel images. Bilevel images can store 
only black and white image data, but grayscale images can also store shades of 
gray. 

To describe such images, you must add or change the following fields. The other 
required fields are the same as those required for bilevel images. 



Differences from Bilevel Images 



Compression = 1 or 32773 (PackBits). In Baseline TIFF, grayscale images can 
either be stored as uncompressed data or compressed with the PackBits algorithm. 

Caution: PackBits is often ineffective on continuous tone images, including many 
grayscale images. In such cases, it is better to leave the image uncompressed. 



BitsPerSample 

Tag =258 (102.H) 

Type = SHORT 

The number of bits per component. 

Allowable values for Baseline TIFF grayscale images are 4 and 8, allowing either 
16 or 256 distinct shades of gray. 



Required Fields for Grayscale Images 



These are the required fields for grayscale images (in numerical order): 
TagName Decimal Hex Type Value 



ImageWidth 


256 


100 


SHORT or LONG 




ImageLength 


257 


101 


SHORT or LONG 




BitsPerSample 


258 


102 


SHORT 


4 or 8 


Compression 


259 


103 


SHORT 


lor 32773 


Photometriclnterpretation 


262 


106 


SHORT 


Oorl 


StripOffsets 


273 


111 


SHORT or LONG 




RowsPerStrip 


278 


116 


SHORT or LONG 




StripByteCounts 


279 


117 


LONG or SHORT 




XResolution 


282 


11A 


RATIONAL 




YResolution 


283 


11B 


RATIONAL 




ResolutionUnit 


296 


128 


SHORT 


1 or 2 or 3 



Baseline TIFF grayscale images were called TIFF Class G images in earlier ver- 
sions of the TIFF specification. 
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Section 5: Palette-color Images 



Palette-color images are similar to grayscale images. They still have one compo- 
nent per pixel, but the component value is used as an index into a full RGB-lookup 
table. To describe such images, you need to add or change the following fields. 
The other required fields are the same as those for grayscale images. 



Differences from Grayscale Images 



Photometriclnterpretation = 3 (Palette Color). 

ColorMap 

Tag =320(140.H) 

Type = SHORT 

N = 3 * (2**BitsPerSample) 

This field defines a Red-Green-Blue color map (often called a lookup table) for 
palette color images. In a palette-color image, a pixel value is used to index into an 
RGB-lookup table. For example, a palette-color pixel having a value of would 
be displayed according to the 0th Red, Green, Blue triplet. 

In a TIFF ColorMap, all the Red values come first, followed by the Green values, 
then the Blue values. In the ColorMap, black is represented by 0,0,0 and white is 
represented by 65535, 65535, 65535. 



Required Fields for Palette Color Images 



These are the required fields for palette-color images (in numerical order): 
TagName Decimal Hex Type Value 



ImageWidth 


256 


100 


SHORT or LONG 




ImageLength 


257 


101 


SHORT or LONG 




BitsPerSample 


258 


102 


SHORT 


4 or 8 


Compression 


259 


103 


SHORT 


lor 32773 


Photometriclnterpretation 


262 


106 


SHORT 


3 


StripOffsets 


273 


111 


SHORT or LONG 




RowsPerStrip 


278 


116 


SHORT or LONG 




StripByteCounts 


279 


117 


LONG or SHORT 




XResolution 


282 


11A 


RATIONAL 




YResolution 


283 


11B 


RATIONAL 




ResolutionUnit 


296 


128 


SHORT 


1 or 2 or 3 


ColorMap 


320 


140 


SHORT 





Baseline TIFF palette-color images were called TIFF Class P images in earlier 
versions of the TIFF specification. 
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Section 6: RGB Full Color Images 



In an RGB image, each pixel is made up of three components: red, green, and 
blue. There is no ColorMap. 

To describe an RGB image, you need to add or change the following fields and 
values. The other required fields are the same as those required for palette-color 
images. 



Differences from Palette Color Images 



BitsPerSample = 8,8,8. Each component is 8 bits deep in a Baseline TIFF RGB 
image. 

Photometriclnterpretation = 2 (RGB). 

There is no ColorMap. 



SamplesPerPixel 



Tag 



;277 (115.H) 



Type = SHORT 

The number of components per pixel. This number is 3 for RGB images, unless 
extra samples are present. See the ExtraSamples field for further information. 



Required Fields for RGB Images 



These are the required fields for RGB 
TagName Decimal 



images (in numerical order): 

Hex Type Value 



ImageWidth 


256 


100 


SHORT or LONG 




ImageLength 


257 


101 


SHORT or LONG 




BitsPerSample 


258 


102 


SHORT 


8,8,8 


Compression 


259 


103 


SHORT 


lor 32773 


Photometriclnterpretation 


262 


106 


SHORT 


2 


StripOffsets 


273 


111 


SHORT or LONG 




SamplesPerPixel 


277 


115 


SHORT 


3 or more 


RowsPerStrip 


278 


116 


SHORT or LONG 




StripByteCounts 


279 


117 


LONG or SHORT 




XResolution 


282 


11A 


RATIONAL 




YResolution 


283 


11B 


RATIONAL 




ResolutionUnit 


296 


128 


SHORT 


l,2or3 
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The BitsPerSample values listed above apply only to the main image data. If 
ExtraSamples are present, the appropriate BitsPerSample values for those 
samples must also be included. 



Baseline TIFF RGB images were called TIFF Class R images in earlier versions 
of the TIFF specification. 
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Section 7: Additional Baseline TIFF 
Requirements 

This section describes characteristics required of all Baseline TIFF files. 

General Requirements 

Options. Where there are options, TIFF writers can use whichever they want. 
Baseline TIFF readers must be able to handle all of them. 

Defaults. TIFF writers may, but are not required to, write out a field that has a 
default value, if the default value is the one desired. TIFF readers must be pre- 
pared to handle either situation. 

Other fields. TIFF readers must be prepared to encounter fields other than those 
required in TIFF files. TIFF writers are allowed to write optional fields such as 
Make, Model, and DateTime, and TIFF readers may use such fields if they exist. 
TIFF readers must not, however, refuse to read the file if such optional fields do 
not exist. TIFF readers must also be prepared to encounter and ignore private 
fields not described in the TIFF specification. 

'MM' and 'II' byte order. TIFF readers must be able to handle both byte orders. 
TIFF writers can do whichever is most convenient or efficient. 

Multiple subfiles. TIFF readers must be prepared for multiple images (subfiles) 
per TIFF file, although they are not required to do anything with images after the 
first one. TIFF writers are required to write a long word of after the last IFD (to 
signal that this is the last IFD), as described earlier in this specification. 

If multiple subfiles are written, the first one must be the full-resolution image. 
Subsequent images, such as reduced-resolution images, may be in any order in the 
TIFF file. If a reader wants to use such images, it must scan the corresponding 
IFD's before deciding how to proceed. 

TIFF Editors. Editors — applications that modify TIFF files — have a few addi- 
tional requirements: 

• TIFF editors must be especially careful about subfiles. If a TIFF editor edits a 
full-resolution subfile, but does not update an accompanying reduced-resolu- 
tion subfile, a reader that uses the reduced-resolution subfile for screen display 
will display the wrong thing. So TIFF editors must either create a new reduced- 
resolution subfile when they alter a full-resolution subfile or they must delete 
any subfiles that they aren't prepared to deal with. 

• A similar situation arises with the fields in an IFD. It is unnecessary — and 
possibly dangerous — for an editor to copy fields it does not understand be- 
cause the editor might alter the file in a way that is incompatible with the un- 
known fields. 

No Duplicate Pointers. No data should be referenced from more than one place. 
TIFF readers and editors are under no obligation to detect this condition and 
handle it properly. This would not be a problem if TIFF files were read-only enti- 



26 



TIFF 6.0 Specification Final — June 3, 1992 



ties, but they are not. This warning covers both TIFF field value offsets and fields 
that are defined as offsets, such as StripOffsets. 

Point to real data. All strip offsets must reference valid locations. (It is not legal to 
use an offset ofO to mean something special.) 

Beware of extra components. Some TIFF files may have more components per 
pixel than you think. A Baseline TIFF reader must skip over them gracefully, 
using the values of the Samples Per Pixel and BitsPerSample fields. For example, 
it is possible that the data will have a Photometriclnterpretation of RGB but have 
4 SamplesPerPixel. See ExtraSamples for further details. 

Beware of new field types. Be prepared to handle unexpected field types such as 
floating-point data. A Baseline TIFF reader must skip over such fields gracefully. 
Do not expect that BYTE, ASCII, SHORT, LONG, and RATIONAL will always be 
a complete list of field types. 

Beware of new pixel types. Some TIFF files may have pixel data that consists of 
something other than unsigned integers. If the SampleFormat field is present and 
the value is not 1, a Baseline TIFF reader that cannot handle the SampleFormat 
value must terminate the import process gracefully. 



Notes on Required Fields 



Image Width, ImageLength. Both "SHORT" and "LONG" TIFF field types are 
allowed and must be handled properly by readers. TIFF writers can use either 
type. TIFF readers are not required to read arbitrarily large files however. Some 
readers will give up if the entire image cannot fit into available memory. (In such 
cases the reader should inform the user about the problem.) Others will probably 
not be able to handle ImageWidth greater than 65535. 



RowsPerStrip. SHORT or LONG. Readers must be able to handle any value 
between 1 and 2**32-1. However, some readers may try to read an entire strip 
into memory at one time. If the entire image is one strip, the application may run 
out of memory. Recommendation: Set RowsPerStrip such that the size of each 
strip is about 8K bytes. Do this even for uncompressed data because it is easy for 
a writer and makes things simpler for readers. Note that extremely wide high- 
resolution images may have rows larger than 8K bytes; in this case, RowsPerStrip 
should be 1, and the strip will be larger than 8K. 

StripOffsets. SHORT or LONG. 

StripByteCounts. SHORT or LONG. 

XResolution, YResolution. RATIONAL. Note that the X and Y resolutions may 
be unequal. A TIFF reader must be able to handle this case. Typically, TIFF pixel- 
editors do not care about the resolution, but applications (such as page layout 
programs) do care. 

ResolutionUnit. SHORT. TIFF readers must be prepared to handle all three 
values for ResolutionUnit. 
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Section 8: Baseline Field Reference Guide 



The Fields 



This section contains detailed information about all the Baseline fields defined in 
this version of TIFF. A Baseline field is any field commonly found in a Baseline 
TIFF file, whether required or not. 

For convenience, fields that were defined in earlier versions of the TIFF specifica- 
tion but are no longer generally recommended have also been included in this 
section. 

New fields that are associated with optional features are not listed in this section. 
See Part 2 for descriptions of these new fields. There is a complete list of all fields 
described in this specification in Appendix A, and there are entries for all TIFF 
fields in the index. 

More fields may be added in future versions. Whenever possible they will be 
added in a way that allows old TIFF readers to read newer TIFF files. 

The documentation for each field contains: 

• the name of the field 

• the Tag number 

• the field Type 

• the required Number of Values (N); i.e., the Count 

• comments describing the field 

• the default, if any 

If the field does not exist, readers must assume the default value for the field. 

Most of the fields described in this part of the document are not required or are 
required only for particular types of TIFF files. See the preceding sections for lists 
of required fields. 

Before defining the fields, you must understand these basic concepts: A Baseline 
TIFF image is defined to be a two-dimensional array of pixels, each of which 
consists of one or more color components. Monochromatic data has one color 
component per pixel, while RGB color data has three color components per pixel. 



Artist 

Person who created the image. 

Tag =315 (13B.H) 

Type = ASCII 

Note: some older TIFF files used this tag for storing Copyright information. 
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BitsPerSample 

Number of bits per component. 

Tag =258 (102.H) 

Type = SHORT 

N = SamplesPerPixel 

Note that this field allows a different number of bits per component for each 
component corresponding to a pixel. For example, RGB color data could use a 
different number of bits per component for each of the three color planes. Most RGB 
files will have the same number of BitsPerSample for each component. Even in this 
case, the writer must write all three values. 

Default = 1 . See also SamplesPerPixel. 

CellLength 

The length of the dithering or halftoning matrix used to create a dithered or 
halftoned bilevel file. 

Tag =265 (109.H) 

Type = SHORT 

N =1 

This field should only be present if Threshholding = 2 

No default. See also Threshholding. 

CellWidth 

The width of the dithering or halftoning matrix used to create a dithered or 
halftoned bilevel file.Tag =264 (108.H) 

Type = SHORT 

N =1 

No default. See also Threshholding. 

ColorMap 

A color map for palette color images. 

Tag =320(140.H) 

Type = SHORT 

N = 3 * (2**BitsPerSample) 

This field defines a Red-Green-Blue color map (often called a lookup table) for 
palette-color images. In a palette-color image, a pixel value is used to index into 
an RGB lookup table. For example, a palette-color pixel having a value of 
would be displayed according to the 0th Red, Green, Blue triplet. 
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In a TIFF ColorMap, all the Red values come first, followed by the Green values, 
then the Blue values. The number of values for each color is 2**BitsPerSample. 
Therefore, the ColorMap field for an 8-bit palette-color image would have 3 * 256 
values. 

The width of each value is 16 bits, as implied by the type of SHORT. represents 
the minimum intensity, and 65535 represents the maximum intensity. Black is 
represented by 0,0,0, and white by 65535, 65535, 65535. 

See also Photometriclnterpretation — palette color. 

No default. ColorMap must be included in all palette-color images. 

Compression 

Compression scheme used on the image data. 
Tag =259 (103.H) 
Type = SHORT 

N =1 

1 = No compression, but pack data into bytes as tightly as possible leaving no unused 

bits except at the end of a row. 

If Then the sample values are stored as an array of type: 

BitsPerSample = 16 for all samples SHORT 

BitsPerSample = 32 for all samples LONG 

Otherwise BYTE 

Each row is padded to the next BYTE/SHORT/LONG boundary, consistent with 
the preceding BitsPerSample rule. 

If the image data is stored as an array of SHORTs or LONGs, the byte ordering 

must be consistent with that specified in bytes and 1 of the TIFF file header. 

Therefore, little-endian format files will have the least significant bytes preceding 

the most significant bytes, while big-endian format files will have the opposite 

order. 

If the number of bits per component is not a power of 2, and you are willing to give up 

some space for better performance, use the next higher power of 2. For example, if 

your data can be represented in 6 bits, set BitsPerSample to 8 instead of 6, and then 

convert the range of the values from [0,63] to [0,255]. 

Rows must begin on byte boundaries. (SHORT boundaries if the data is stored as 
SHORTs, LONG boundaries if the data is stored as LONGs). 

Some graphics systems require image data rows to be word-aligned or double-word- 
aligned, and padded to word-boundaries or double-word boundaries. Uncompressed 
TIFF rows will need to be copied into word-aligned or double-word-aligned row 
buffers before being passed to the graphics routines in these environments. 

2 = CCITT Group 3 1-Dimensional Modified Huffman run-length encoding. See 

Section 10. BitsPerSample must be 1, since this type of compression is defined 
only for bilevel images. 
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32773 = PackBits compression, a simple byte-oriented run-length scheme. See Section 9 
for details. 

Data compression applies only to the image data, pointed to by StripOffsets. 

Default = 1. 

Copyright 

Copyright notice. 

Tag =33432 (8298.H) 

Type = ASCII 

Copyright notice of the person or organization that claims the copyright to the 
image. The complete copyright statement should be listed in this field including 
any dates and statements of claims. For example, "Copyright, John Smith, 19xx. 
All rights reserved." 

DateTime 

Date and time of image creation. 
Tag =306 (132.H) 
Type = ASCII 

N =20 

The format is: "YYYY:MM:DD HH:MM:SS", with hours like those on a 24-hour 
clock, and one space character between the date and the time. The length of the 
string, including the terminating NUL, is 20 bytes. 

ExtraSamples 

Description of extra components. 

Tag =338(152.11) 

Type = SHORT 

N =m 

Specifies that each pixel has m extra components whose interpretation is defined 
by one of the values listed below. When this field is used, the SamplesPerPixel 
field has a value greater than the Photometriclnterpretation field suggests. 

For example, full-color RGB data normally has SamplesPerPixel=3. If 
SamplesPerPixel is greater than 3, then the ExtraSamples field describes the 
meaning of the extra samples. If SamplesPerPixel is, say, 5 then ExtraSamples 
will contain 2 values, one for each extra sample. 

ExtraSamples is typically used to include non-color information, such as opacity, 
in an image. The possible values for each item in the field's value are: 

= Unspecified data 

1 = Associated alpha data (with pre-multiplied color) 
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Unassociated alpha data 



Associated alpha data is opacity information; it is fully described in Section 21. 
Unassociated alpha data is transparency information that logically exists indepen- 
dent of an image; it is commonly called a soft matte. Note that including both 
unassociated and associated alpha is undefined because associated alpha specifies 
that color components are pre-multiplied by the alpha component, while 
unassociated alpha specifies the opposite. 

By convention, extra components that are present must be stored as the "last com- 
ponents" in each pixel. For example, if SamplesPerPixel is 4 and there is 1 extra 
component, then it is located in the last component location (SamplesPerPixel- 1) 
in each pixel. 

Components designated as "extra" are just like other components in a pixel. In 
particular, the size of such components is defined by the value of the 
BitsPerSample field. 

With the introduction of this field, TIFF readers must not assume a particular 
SamplesPerPixel value based on the value of the Photometriclnterpretation field. 
For example, if the file is an RGB file, SamplesPerPixel may be greater than 3. 

The default is no extra samples. This field must be present if there are extra 
samples. 

See also SamplesPerPixel, AssociatedAlpha. 



FillOrder 

The logical order of bits within a byte. 
Tag =266 (10A.H) 
Type = SHORT 
N =1 

1 = pixels are arranged within a byte such that pixels with lower column values are 

stored in the higher-order bits of the byte. 

1-bit uncompressed data example: Pixel of a row is stored in the high-order bit 
of byte 0, pixel 1 is stored in the next-highest bit, ..., pixel 7 is stored in the low- 
order bit of byte 0, pixel 8 is stored in the high-order bit of byte 1, and so on. 

CCITT 1 -bit compressed data example: The high-order bit of the first compres- 
sion code is stored in the high-order bit of byte 0, the next-highest bit of the first 
compression code is stored in the next-highest bit of byte 0, and so on. 

2 = pixels are arranged within a byte such that pixels with lower column values are 

stored in the lower-order bits of the byte. 

We recommend that FillOrder=2 be used only in special-purpose applications. It 
is easy and inexpensive for writers to reverse bit order by using a 256-byte lookup 
table. FillOrder = 2 should be used only when BitsPerSample = 1 and the data is 
either uncompressed or compressed using CCITT ID or 2D compression, to 
avoid potentially ambigous situations. 

Support for FillOrder=2 is not required in a Baseline TIFF compliant reader 

Default is FillOrder = 1 . 
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FreeByteCounts 

For each string of contiguous unused bytes in a TIFF file, the number of bytes in 
the string. 

Tag =289 (121.H) 

Type = LONG 

Not recommended for general interchange. 

See also FreeOffsets. 



FreeOffsets 

For each string of contiguous unused bytes in a TIFF file, the byte offset of the 
string. 

Tag =288 (120.H) 

Type = LONG 

Not recommended for general interchange. 

See also FreeByteCounts. 

GrayResponseCurve 

For grayscale data, the optical density of each possible pixel value. 

Tag =291(123.H) 

Type = SHORT 

N =2**BitsPerSample 

The 0th value of GrayResponseCurve corresponds to the optical density of a pixel 
having a value of 0, and so on. 

This field may provide useful information for sophisticated applications, but it is 
currently ignored by most TIFF readers. 

See also GrayResponseUnit, Photometriclnterpretation. 

GrayResponseUnit 

The precision of the information contained in the GrayResponseCurve. 
Tag =290(122.H) 
Type = SHORT 

N =1 

Because optical density is specified in terms of fractional numbers, this field is 
necessary to interpret the stored integer information. For example, if 
GrayScaleResponseUnits is set to 4 (ten-thousandths of a unit), and a 
GrayScaleResponseCurve number for gray level 4 is 3455, then the resulting 
actual value is 0.3455. 

Optical densitometers typically measure densities within the range of 0.0 to 2.0. 
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1 = Number represents tenths of a unit. 

2 = Number represents hundredths of a unit. 

3 = Number represents thousandths of a unit. 

4 = Number represents ten-thousandths of a unit. 

5 = Number represents hundred-thousandths of a unit. 

Modifies GrayResponseCurve. 

See also GrayResponseCurve. 

For historical reasons, the default is 2. However, for greater accuracy, 3 is recom- 
mended. 

HostComputer 

The computer and/or operating system in use at the time of image creation. 

Tag =316 (13C.H) 

Type = ASCII 

See also Make, Model, Software. 

ImageDescription 

A string that describes the subject of the image. 

Tag = 270 (10E.H) 

Type = ASCII 

For example, a user may wish to attach a comment such as "1988 company pic- 
nic" to an image. 

ImageLength 

The number of rows of pixels in the image. 

Tag =257 (101.H) 

Type = SHORT or LONG 

N =1 

No default. See also ImageWidth. 

ImageWidth 

The number of columns in the image, i.e., the number of pixels per row. 

Tag =256 (100.H) 

Type = SHORT or LONG 

N =1 

No default. See also ImageLength. 
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Make 

The scanner manufacturer. 

Tag =271 (10F.H) 

Type = ASCII 

Manufacturer of the scanner, video digitizer, or other type of equipment used to 
generate the image. Synthetic images should not include this field. 

See also Model, Software. 

MaxSample Value 

The maximum component value used. 

Tag =281 (119.H) 

Type = SHORT 

N = SamplesPerPixel 

This field is not to be used to affect the visual appearance of an image when it is 
displayed or printed. Nor should this field affect the interpretation of any other 
field; it is used only for statistical purposes. 

Default is 2**(BitsPerSample) - 1. 

MinSample Value 

The minimum component value used. 

Tag =280 (118.H) 

Type = SHORT 

N = SamplesPerPixel 

See also MaxSampleValue. 

Default is 0. 

Model 

The scanner model name or number. 

Tag =272 (110.H) 

Type = ASCII 

The model name or number of the scanner, video digitizer, or other type of equip- 
ment used to generate the image. 

See also Make, Software. 
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NewSubfileType 

A general indication of the kind of data contained in this subfile. 

Tag = 254 (FE.H) 

Type = LONG 

N=l 

Replaces the old SubfileType field, due to limitations in the definition of that field. 

NewSubfileType is mainly useful when there are multiple subfiles in a single 
TIFF file. 

This field is made up of a set of 32 flag bits. Unused bits are expected to be 0. Bit 
is the low-order bit. 

Currently defined values are: 

Bit is 1 if the image is a reduced-resolution version of another image in this TIFF file; 
else the bit is 0. 

Bit 1 is 1 if the image is a single page of a multi-page image (see the PageNumber field 
description); else the bit is 0. 

Bit 2 is 1 if the image defines a transparency mask for another image in this TIFF file. 
The Photometriclnterpretation value must be 4, designating a transparency mask. 

These values are defined as bit flags because they are independent of each other. 

Default is 0. 

Orientation 

The orientation of the image with respect to the rows and columns. 
Tag =274(112.H) 
Type = SHORT 

N =1 

1 = The 0th row represents the visual top of the image, and the 0th column represents 

the visual left-hand side. 

2 = The 0th row represents the visual top of the image, and the 0th column represents 

the visual right-hand side. 

3 = The 0th row represents the visual bottom of the image, and the 0th column repre- 

sents the visual right-hand side. 

4 = The 0th row represents the visual bottom of the image, and the 0th column repre- 

sents the visual left-hand side. 

5 = The 0th row represents the visual left-hand side of the image, and the 0th column 

represents the visual top. 

6 = The 0th row represents the visual right-hand side of the image, and the 0th column 

represents the visual top. 

7 = The 0th row represents the visual right-hand side of the image, and the 0th column 

represents the visual bottom. 



36 



TIFF 6.0 Specification Final — June 3, 1992 



8 = The 0th row represents the visual left-hand side of the image, and the 0th column 
represents the visual bottom. 

Default is 1 . 

Support for orientations other than 1 is not a Baseline TIFF requirement. 

Photometriclnterpretation 

The color space of the image data. 
Tag =262(106.H) 
Type = SHORT 

N =1 

0= WhitelsZero. For bilevel and grayscale images: is imaged as white. 

2**BitsPerSample-l is imaged as black. This is the normal value for Compres- 
sion=2. 

1 = BlacklsZero. For bilevel and grayscale images: is imaged as black. 

2**BitsPerSample-l is imaged as white. If this value is specified for Compres- 
sion=2, the image should display and print reversed. 

2 = RGB. In the RGB model, a color is described as a combination of the three pri- 
mary colors of light (red, green, and blue) in particular concentrations. For each of 
the three components, represents minimum intensity, and 2**BitsPerSample - 1 
represents maximum intensity. Thus an RGB value of (0,0,0) represents black, 
and (255,255,255) represents white, assuming 8-bit components. For 
PlanarConfiguration = 1, the components are stored in the indicated order: first 
Red, then Green, then Blue. For PlanarConfiguration = 2, the StripOffsets for the 
component planes are stored in the indicated order: first the Red component plane 
StripOffsets, then the Green plane StripOffsets, then the Blue plane StripOffsets. 

3= Palette color. In this model, a color is described with a single component. The 
value of the component is used as an index into the red, green and blue curves in 
the ColorMap field to retrieve an RGB triplet that defines the color. When 
PhotometricInterpretation=3 is used, ColorMap must be present and 
Samp lesPerPixel must be 1. 

4 = Transparency Mask. 

This means that the image is used to define an irregularly shaped region of another 
image in the same TIFF file. SamplesPerPixel and BitsPerSample must be 1. 
PackBits compression is recommended. The 1-bits define the interior of the re- 
gion; the 0-bits define the exterior of the region. 

A reader application can use the mask to determine which parts of the image to 
display. Main image pixels that correspond to 1 -bits in the transparency mask are 
imaged to the screen or printer, but main image pixels that correspond to 0-bits in 
the mask are not displayed or printed. 

The image mask is typically at a higher resolution than the main image, if the 
main image is grayscale or color so that the edges can be sharp. 

There is no default for Photometriclnterpretation, and it is required. Do not rely 
on applications defaulting to what you want. 
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PlanarConfiguration 

How the components of each pixel are stored. 
Tag =284(11C.H) 
Type = SHORT 

N =1 

1 = Chunky format. The component values for each pixel are stored contiguously. 
The order of the components within the pixel is specified by 
Photometriclnterpretation. For example, for RGB data, the data is stored as 
RGBRGBRGB... 

2 = Planar format. The components are stored in separate "component planes." The 
values in StripOffsets and StripByteCounts are then arranged as a 2-dimensional 
array, with SamplesPerPixel rows and StripsPerlmage columns. (All of the col- 
umns for row are stored first, followed by the columns of row 1, and so on.) 
Photometriclnterpretation describes the type of data stored in each component 
plane. For example, RGB data is stored with the Red components in one compo- 
nent plane, the Green in another, and the Blue in another. 

PlanarConfiguration=2 is not currently in widespread use and it is not recom- 
mended for general interchange. It is used as an extension and Baseline TIFF 
readers are not required to support it. 

If SamplesPerPixel is 1, PlanarConfiguration is irrelevant, and need not be in- 
cluded. 

If a row interleave effect is desired, a writer might write out the data as 
PlanarConfiguration=2 — separate sample planes — but break up the planes into 
multiple strips (one row per strip, perhaps) and interleave the strips. 

Default is 1 . See also BitsPerSample, SamplesPerPixel. 

ResolutionUnit 

The unit of measurement for XResolution and YResolution. 

Tag =296(128.H) 

Type = SHORT 

N =1 

To be used with XResolution and YResolution. 

1 = No absolute unit of measurement. Used for images that may have a non-square 

aspect ratio, but no meaningful absolute dimensions. 

The drawback of ResolutionUnit=l is that different applications will import the image 
at different sizes. Even if the decision is arbitrary, it might be better to use dots per 
inch or dots per centimeter, and to pick XResolution and YResolution so that the 
aspect ratio is correct and the maximum dimension of the image is about four inches 
(the "four" is arbitrary.) 

2 = Inch. 

3 = Centimeter. 

Default is 2. 
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RowsPerStrip 

The number of rows per strip. 
Tag =278 (116.H) 
Type = SHORT or LONG 

N =1 

TIFF image data is organized into strips for faster random access and efficient I/O 

buffering. 

RowsPerStrip and ImageLength together tell us the number of strips in the entire 

image. The equation is: 

StripsPerlmage = floor ((ImageLength + RowsPerStrip - 1) / RowsPerStrip). 

StripsPerlmage is not a field. It is merely a value that a TIFF reader will want to 
compute because it specifies the number of StripOffsets and StripByteCounts for the 
image. 

Note that either SHORT or LONG values can be used to specify RowsPerStrip. 
SHORT values may be used for small TIFF files. It should be noted, however, that 
earlier TIFF specification revisions required LONG values and that some software 
may not accept SHORT values. 

The default is 2**32 - 1, which is effectively infinity. That is, the entire image is 

one strip. 

Use of a single strip is not recommended. Choose RowsPerStrip such that each strip is 

about 8K bytes, even if the data is not compressed, since it makes buffering simpler 

for readers. The "8K" value is fairly arbitrary, but seems to work well. 

See also ImageLength, StripOffsets, StripByteCounts, TileWidth, TileLength, 
TileOffsets, TileByteCounts. 

SamplesPerPixel 

The number of components per pixel. 
Tag =277 (115.H) 
Type = SHORT 

N =1 

SamplesPerPixel is usually 1 for bilevel, grayscale, and palette-color images. 
SamplesPerPixel is usually 3 for RGB images. 

Default = 1. See also BitsPerSample, Photometriclnterpretation, ExtraSamples. 

Software 

Name and version number of the software package(s) used to create the image. 

Tag =305 (131.H) 

Type = ASCII 

See also Make, Model. 
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StripByteCounts 

For each strip, the number of bytes in the strip after compression. 

Tag =279 (117.H) 

Type = SHORT or LONG 

N = StripsPerlmage for PlanarConfiguration equal to 1. 

= SamplesPerPixel * StripsPerlmage for PlanarConfiguration equal to 2 
This tag is required for Baseline TIFF files. 
No default. 
See also StripOffsets, RowsPerStrip, TileOffsets, TileByteCounts. 

StripOffsets 

For each strip, the byte offset of that strip. 

Tag =273 (lll.H) 

Type = SHORT or LONG 

N = StripsPerlmage for PlanarConfiguration equal to 1. 

= SamplesPerPixel * StripsPerlmage for PlanarConfiguration equal to 2 

The offset is specified with respect to the beginning of the TIFF file. Note that this 
implies that each strip has a location independent of the locations of other strips. 
This feature may be useful for editing applications. This required field is the only 
way for a reader to find the image data. (Unless TileOffsets is used; see 
TileOffsets.) 

Note that either SHORT or LONG values may be used to specify the strip offsets. 
SHORT values may be used for small TIFF files. It should be noted, however, that 
earlier TIFF specifications required LONG strip offsets and that some software 
may not accept SHORT values. 

For maximum compatibility with operating systems such as MS-DOS and Win- 
dows, the StripOffsets array should be less than or equal to 64K bytes in length, 
and the strips themselves, in both compressed and uncompressed forms, should 
not be larger than 64K bytes. 

No default. See also StripByteCounts, RowsPerStrip, TileOffsets, 
TileByteCounts. 

SubfileType 

A general indication of the kind of data contained in this subfile. 
Tag =255 (FF.H) 
Type = SHORT 

N =1 
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Currently defined values are: 

1 = full-resolution image data 

2 = reduced-resolution image data 

3 = a single page of a multi-page image (see the PageNumber field description). 

Note that several image types may be found in a single TIFF file, with each subfile 
described by its own IFD. 

No default. 

This field is deprecated. The NewSubfileType field should be used instead. 

Threshholding 

For black and white TIFF files that represent shades of gray, the technique used to 
convert from gray to black and white pixels. 

Tag =263 (107.H) 

Type = SHORT 

N =1 

1 = No dithering or halftoning has been applied to the image data. 

2 = An ordered dither or halftone technique has been applied to the image data. 

3 = A randomized process such as error diffusion has been applied to the image data. 

Default is Threshholding = 1. See also CellWidth, CellLength. 

XResolution 

The number of pixels per ResolutionUnit in the ImageWidth direction. 
Tag =282 (11A.H) 
Type = RATIONAL 

N =1 

It is not mandatory that the image be actually displayed or printed at the size implied 
by this parameter. It is up to the application to use this information as it wishes. 

No default. See also YResolution, ResolutionUnit. 

YResolution 

The number of pixels per ResolutionUnit in the ImageLength direction. 

Tag =283 (11B.H) 

Type = RATIONAL 

N =1 

No default. See also XResolution, ResolutionUnit. 
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Section 9: PackBits Compression 



This section describes TIFF compression type 32773, a simple byte-oriented run- 
length scheme. 



Description 



In choosing a simple byte-oriented run-length compression scheme, we arbitrarily 
chose the Apple Macintosh PackBits scheme. It has a good worst case behavior 
(at most 1 extra byte for every 128 input bytes). For Macintosh users, the toolbox 
utilities PackBits and UnPackBits will do the work for you, but it is easy to imple- 
ment your own routines. 

A pseudo code fragment to unpack might look like this: 

Loop until you get the number of unpacked bytes you are expecting: 

Read the next source byte into n. 

If n is between and 127 inclusive, copy the next n+1 bytes literally. 

Else if n is between -127 and -1 inclusive, copy the next byte -n+1 

times. 

Else if n is -128, noop. 
Endloop 



In the inverse routine, it is best to encode a 2-by te repeat run as a replicate run 
except when preceded and followed by a literal run. In that case, it is best to merge 
the three runs into one literal run. Always encode 3-byte repeats as replicate runs. 

That is the essence of the algorithm. Here are some additional rules: 

• Pack each row separately. Do not compress across row boundaries. 

• The number of uncompressed bytes per row is defined to be (ImageWidth + 7) 
/ 8. If the uncompressed bitmap is required to have an even number of bytes per 
row, decompress into word-aligned buffers. 

• If a run is larger than 128 bytes, encode the remainder of the run as one or more 
additional replicate runs. 

When PackBits data is decompressed, the result should be interpreted as per com- 
pression type 1 (no compression). 
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Section 10: Modified Huffman Compression 



References 



This section describes TIFF compression scheme 2, a method for compressing 
bilevel data based on the CCITT Group 3 ID facsimile compression scheme. 



• "Standardization of Group 3 facsimile apparatus for document transmission," 
Recommendation T.4, Volume VII, Fascicle VII.3, Terminal Equipment and 
Protocols for Telematic Services, The International Telegraph and Telephone 
Consultative Committee (CCITT), Geneva, 1985, pages 16 through 3 1 . 

• "Facsimile Coding Schemes and Coding Control Functions for Group 4 Fac- 
simile Apparatus," Recommendation T.6, Volume VII, Fascicle VII.3, Termi- 
nal Equipment and Protocols for Telematic Services, The International 
Telegraph and Telephone Consultative Committee (CCITT), Geneva, 1985, 
pages 40 through 48. 

We do not believe that these documents are necessary in order to implement Com- 
pression=2. We have included (verbatim in most places) all the pertinent informa- 
tion in this section. However, if you wish to order the documents, you can write to 
ANSI, Attention: Sales, 1430 Broadway, New York, N.Y., 10018. Ask for the 
publication listed above — it contains both Recommendation T.4 and T.6. 



Relationship to the CCITT Specifications 



The CCITT Group 3 and Group 4 specifications describe communications proto- 
cols for a particular class of devices. They are not by themselves sufficient to 
describe a disk data format. Fortunately, however, the CCITT coding schemes can 
be readily adapted to this different environment. The following is one such adap- 
tation. Most of the language is copied directly from the CCITT specifications. 



See Section 11 for additional CCITT compression options. 



Coding Scheme 



A line (row) of data is composed of a series of variable length code words. Each 
code word represents a run length of all white or all black. (Actually, more than 
one code word may be required to code a given run, in a manner described below.) 
White runs and black runs alternate. 

To ensure that the receiver (decompressor) maintains color synchronization, all 
data lines begin with a white run-length code word set. If the actual scan line 
begins with a black run, a white run-length of zero is sent (written). Black or white 
run-lengths are defined by the code words in Tables 1 and 2. The code words are 
of two types: Terminating code words and Make-up code words. Each run-length 
is represented by zero or more Make-up code words followed by exactly one 
Terminating code word. 
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Run lengths in the range of to 63 pels (pixels) are encoded with their appropriate 
Terminating code word. Note that there is a different list of code words for black 
and white run-lengths. 

Run lengths in the range of 64 to 2623 (2560+63) pels are encoded first by the 
Make-up code word representing the run-length that is nearest to, not longer than, 
that required. This is then followed by the Terminating code word representing 
the difference between the required run-length and the run-length represented by 
the Make-up code. 

Run lengths in the range of lengths longer than or equal to 2624 pels are coded 
first by the Make-up code of 2560. If the remaining part of the run (after the first 
Make-up code of 2560) is 2560 pels or greater, additional Make-up code(s) of 
2560 are issued until the remaining part of the run becomes less than 2560 pels. 
Then the remaining part of the run is encoded by Terminating code or by Make-up 
code plus Terminating code, according to the range mentioned above. 

It is considered an unrecoverable error if the sum of the run-lengths for a line does 
not equal the value of the ImageWidth field. 

New rows always begin on the next available byte boundary. 

No EOL code words are used. No fill bits are used, except for the ignored bits at 
the end of the last byte of a row. RTC is not used. 

An encoded CCITT string is self- photometric, defined in terms of white and black 
runs. Yet TIFF defines a tag called Photometriclnterpretation that also purports 
to define what is white and what is black. Somewhat arbitrarily, we adopt the 
following convention: 

The "normal" Photometriclnterpretation for bilevel CCITT compressed data is 
WhitelsZero. In this case, the CCITT "white " runs are to be interpretated as 
white, and the CCITT "black" runs are to be interpreted as black. However, if the 
Photometriclnterpretation is BlacklsZero, the TIFF reader must reverse the 
meaning of white and black when displaying and printing the image. 
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Table 1/T.4 Terminating codes 



White 

run 

length 



Code 
word 



Black 

run Code 
length word 





: 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 
51 



00110101 

000111 

0111 

1000 

1011 

1100 

1110 

1111 

10011 

10100 

00111 

01000 

001000 

000011 

110100 

110101 

101010 

101011 

0100111 

0001100 

0001000 

0010111 

0000011 

0000100 

0101000 

0101011 

0010011 

0100100 

0011000 

00000010 

00000011 

00011010 

00011011 

00010010 

00010011 

00010100 

00010101 

00010110 

00010111 

00101000 

00101001 

00101010 

00101011 

00101100 

00101101 

00000100 

00000101 

00001010 

00001011 

01010010 

01010011 

01010100 





: 

2 

3 
4 
5 
6 
7 
8 
9 
10 

11 

12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 
51 



000011 

010 

11 

10 

011 

0011 

0010 

00011 

000101 

000100 

000010 

000010 

000011 

000001 

000001 

000011 

000001 

000001 

000000 

000011 

000011 

000011 

000001 

000001 

000000 

000000 

000011 

011 

000011 

000011 

000001 

000001 

000001 

000001 

000011 

000011 

000011 

000011 

000011 

000011 

000001 

000001 

000011 

000011 

000001 

000001 

000001 

000001 

000001 

000001 

000001 

000001 



0111 





1 
1 

00 

11 

000 

0111 

1000 

1000 

00111 

01000 

01100 

10111 

01000 

10111 

11000 

001010 

001011 

001100 

001101 

101000 

101001 

101010 

101011 

010010 

010011 

010100 

010101 

010110 

010111 

101100 

101101 

011010 

011011 

010100 

010101 

010110 

010111 

100100 

100101 

010010 

010011 
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White 

run 

length 



Code 
word 



Black 

run Code 
length word 



52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 



01010101 
00100100 
00100101 
01011000 
01011001 
01011010 
01011011 
01001010 
01001011 
00110010 
00110011 
00110100 



52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 



000000 
000000 
000000 
000000 
000000 
000001 
000001 
000000 
000000 
000001 
000001 
000001 



100100 
110111 
111000 
100111 
101000 
011000 
011001 
101011 
101100 
011010 
100110 
100111 



Table 2/T.4 Make-up codes 



White 

run 

length 



Code 
word 



Black 

run Code 
length word 



64 

128 

192 

256 

320 

384 

448 

512 

576 

640 

704 

768 

832 

896 

960 

1024 

1088 

1152 

1216 

1280 

1344 

1408 

1472 

1536 

1600 

1664 

1728 

EOL 



11011 
10010 
01011 
01101 
00110 
00110 
01100 
01100 
01101 
01100 
01100 
01100 
01101 
01101 
01101 
01101 
01101 
01101 
01101 
01101 
01101 
01101 
01001 
01001 
01001 
01100 
01001 
00000 



1 
11 

110 

111 

100 
101 
000 

111 

1100 

1101 

0010 

0011 

0100 

0101 

0110 

0111 

1000 

1001 

1010 

1011 

1000 

1001 

1010 



1011 

0000001 



64 

128 

192 

256 

320 

384 

448 

512 

576 

640 

704 

768 

832 

896 

960 

1024 

1088 

1152 

1216 

1280 

1344 

1408 

1472 

1536 

1600 

1664 

1728 

EOL 



000000 
000011 
000011 
01 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 



1111 

001000 

001001 

011011 

110011 

110100 

110101 

1101100 

1101101 

1001010 

1001011 

1001100 

1001101 

1110010 

1110011 

1110100 

1110101 

1110110 

1110111 

1010010 

1010011 

1010100 

1010101 

1011010 

1011011 

1100100 

1100101 

00000 
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Additional make-up codes 



White 

and 

Black Make-up 

run code 

length word 



1792 00000001000 

1856 00000001100 

1920 00000001101 

1984 000000010010 

2048 000000010011 

2112 000000010100 

2176 000000010101 

2240 000000010110 

2304 000000010111 

2368 000000011100 

2432 000000011101 

2496 000000011110 

2560 000000011111 
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Part 2: TIFF Extensions 



Part 2 contains extensions to Baseline TIFF. TIFF Extensions are TIFF features 
that may not be supported by all TIFF readers. TIFF creators who use these fea- 
tures will have to work closely with TIFF readers in their part of the industry to 
ensure successful interchange. 

The features described in this part were either contained in earlier versions of the 
specification, or have been approved by the TIFF Advisory Committee. 
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Section 11: CCITT Bilevel Encodings 



The following fields are used when storing binary pixel arrays using one of the 
encodings adopted for raster-graphic interchange in numerous CCITT and ISO 
(International Organization for Standards) recommendations and standards. These 
encodings are often spoken of as "Group III compression" and "Group IV com- 
pression" because their application in facsimile transmission is the most widely 
known. 

For the specialized use of these encodings in storing facsimile-transmission images, 
further guidelines can be obtained from the TIFF Class F document, available on-line 
in the same locations as this specification. This document is administered by another 
organization; paper copies are not available from Adobe. 



Compression 

Tag =259 (103.H) 
Type = SHORT 

N =1 

T4-encoding: CCITT T.4 bi-level encoding as specified in section 4, Coding, of 
CCITT Recommendation T.4: "Standardization of Group 3 Facsimile apparatus 
for document transmission." International Telephone and Telegraph Consultative 
Committee (CCITT, Geneva: 1988). 

See the T40ptions field for T4-encoding options such as ID vs 2D coding. 

T6-encoding: CCITT T.6 bi-level encoding as specified in section 2 of CCITT 
Recommendation T.6: "Facsimile coding schemes and coding control functions 
for Group 4 facsimile apparatus." International Telephone and Telegraph Consul- 
tative Committee (CCITT, Geneva: 1988). 

See the T60ptions field for T6-encoding options such as escape into 
uncompressed mode to avoid negative-compression cases. 



Application in Image Interchange 

CCITT Recommendations T.4 and T.6 are specified in terms of the serial bit-by- 
bit creation and processing of a variable-length binary string that encodes bi-level 
(black and white) pixels of a rectangular image array. Generally, the encoding 
schemes are described in terms of bit-serial communication procedures and the 
end-to-end coordination that is required to gain reliable delivery over inherently 
unreliable data links. The Group 4 procedures, with their T6-encoding, represent a 
significant simplification because it is assumed that a reliable communication 
medium is employed, whether ISDN or X.25 or some other trustworthy transport 
vehicle. Because image-storage systems and computers achieve data integrity and 
communication reliability in other ways, the T6-encoding tends to be prefered for 
imaging applications. When computer storage and retrieval and interchange of 
facsimile material are of interest, the T4-encodings provide a better match to the 
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current generation of Group 3 facsimile products and their defenses against data 
corruption as the result of transmission defects. 

Whichever form of encoding is preferable for a given application, there are a 
number of adjustments that need to be made to account for the capture of the 
CCITT binary-encoding strings as part of electronically-stored material and digi- 
tal-image interchange. 

Photometriclnterpretation. An encoded CCITT string is self-photometric, de- 
fined in terms of white and black runs. Yet TIFF defines a tag called 
Photometriclnterpretation that also purports to define what is white and what is 
black. Somewhat arbitrarily, we adopt the following convention: 

The "normal" Photometriclnterpretation for bilevel CCITT compressed data is 
WhitelsZero. In this case, the CCITT "white" runs are to be interpretated as white, 
and the CCITT "black" runs are to be interpreted as black. However, if the 
Photometriclnterpretation is BlacklsZero, the TIFF reader must reverse the mean- 
ing of white and black when displaying and printing the image. 

FillOrder. When CCITT encodings are used directly over a typical serial commu- 
nication link, the order of the bits in the encoded string is the sequential order of 
the string, bit-by-bit, from beginning to end. This poses the following question: In 
which order should consecutive blocks of eight bits be assembled into octets 
(standard data bytes) for use within a computer system? The answer differs de- 
pending on whether we are concerned about preserving the serial -transmission 
sequence or preserving only the format of byte-organized sequences in memory 
and in stored files. 

From the perspective of electronic interchange, as long as a receiver's reassembly 
of bits into bytes properly mirrors the way in which the bytes were disassembled 
by the transmitter, no one cares which order is seen on the transmission link be- 
cause each multiple of 8 bits is transparently transmitted. 

Common practice is to record arbitrary binary strings into storage sequences such 
that the first sequential bit of the string is found in the high-order bit of the first 
octet of the stored byte sequence. This is the standard case specified by TIFF 
FillOrder = 1, used in most bitmap interchange and the only case required in 
Baseline TIFF. This is also the approach used for the octets of standard 8-bit char- 
acter data, with little attention paid to the fact that the most common forms of data 
communication transmit and reassemble individual 8-bit frames with the low- 
order-bit first! 

For bit-serial transmission to a distant unit whose approach to assembling bits into 
bytes is unknown and supposed to be irrelevant, it is necessary to satisfy the ex- 
pected sequencing of bits over the transmission link. This is the normal case for 
communication between facsimile units and also for computers and modems 
emulating standard Group 3 facsimile units. In this case, if the CCITT encoding is 
captured directly off of the link via standard communication adapters, TIFF 
FillOrder = 2 will usually apply to that stored data form. 

Consequently, different TIFF FillOrder cases may arise when CCITT encodings 
are obtained by synthesis within a computer (including Group 4 transmission, 
which is treated more like computer data) instead of by capture from a Group 3 
facsimile unit. 

Because this is such a subtle situation, with surprisingly disruptive consequences 
for FillOrder mismatches, the following practice is urged whenever CCITT bi- 
level encodings are used: 
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a. TIFF FillOrder (tag 266) should always be explicitly specified. 

b. FillOrder = 1 should be employed wherever possible in persistent material 
that is intended for interchange. This is the only reliable case for widespread 
interchange among computer systems, and it is important to explicitly con- 
firm the honoring of standard assumptions. 

c. FillOrder = 2 should occur only in highly-localized and preferably-transient 
material, as in a facsimile server supporting group 3 facsimile equipment. 
The tag should be present as a safeguard against the CCITT encoding "leak- 
ing" into an unsuspecting application, allowing readers to detect and warn 
against the occurence. 

There are interchange situations where fill order is not distinguished, as when 
filtering the CCITT encoding into a PostScript level 2 image operation. In this 
case, as in most other cases of computer-based information interchange, 
FillOrder=l is assumed, and any padding to a multiple of 8 bits is accomplished 
by adding a sufficient number of 0-bits to the end of the sequence. 

Strips and Tiles. When CCITT bi-level encoding is employed, interaction with 
stripping (Section 3) and tiling (Section 15) is as follows: 

a. Decompose the image into segments — individual pixel arrays representing 
the desired strip or tile configuration. The CCITT encoding procedures are 
applied most flexibly if the segments each have a multiple of 4 lines. 

b. Individually encode each segment according to the specified CCITT bi- 
level encoding, as if each segment is a separate raster-graphic image. 

The reason for this general rule is that CCITT bi-level encodings are generally 
progressive. That is, the initial line of pixels is encoded, and then subsequent lines, 
according to a variety of options, are encoded in terms of changes that need to be 
made to the preceding (unencoded) line. For strips and tiles to be individually 
usable, they must each start as fresh, independent encodings. 

Miscellaneous features. There are provisions in CCITT encoding that are mostly 
meaningful during facsimile-transmission procedures. There is generally no sig- 
nificant application when storing images in TIFF or other data interchange for- 
mats, although TIFF applications should be tolerant and flexible in this regard. 
These features tend to have significance only when facilitating transfer between 
facsimile and non-facsimile applications of the encoded raster-graphic images. 
Further considerations for fill sequences, end-of-line flags, return-to-control (end- 
of-block) sequences and byte padding are introduced in discussion of the indi- 
vidual encoding options. 



T40ptions 

Tag =292(124.H) 
Type = LONG 

N =1 

See Compression=3. This field is made up of a set of 32 flag bits. Unused bits 
must be set to 0. Bit is the low-order bit. 

Bit is 1 for 2-dimensional coding (otherwise 1 -dimensional is assumed). For 
2-D coding, if more than one strip is specified, each strip must begin with a 1- 
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dimensionally coded line. That is, RowsPerStrip should be a multiple of "Param- 
eter K," as documented in the CCITT specification. 

Bit 1 is 1 if uncompressed mode is used. 

Bit 2 is 1 if fill bits have been added as necessary before EOL codes such that 
EOL always ends on a byte boundary, thus ensuring an EOL-sequence of 1 byte 
preceded by a zero nibble: xxxx-0000 0000-0001. 

Default is 0, for basic 1-dimensional coding. See also Compression. 



T60ptions 

Tag = 293 (125.H) 
Type = LONG 

N =1 

See Compression = 4. This field is made up of a set of 32 flag bits. Unused bits 
must be set to 0. Bit is the low-order bit. The default value is (all bits 0). 

bitO is unused and always 0. 

bit 1 is 1 if uncompressed mode is allowed in the encoding. 

In earlier versions of TIFF, this tag was named Group40ptions. The significance 
has not changed and the present definition is compatible. The name of the tag has 
been changed to be consistent with the nomenclature of other T.6-encoding appli- 
cations. 

Readers should honor this option tag, and only this option tag, whenever T.6- 
Encoding is specified for Compression. 

For T.6-Encoding, each segment (strip or tile) is encoded as if it were a separate 
image. The encoded string from each segment starts a fresh byte. 

There are no one-dimensional line encodings in T.6-Encoding. Instead, even the 
first row of the segment' s pixel array is encoded two-dimensionally by always 
assuming an invisible preceding row of all-white pixels. The 2-dimensional pro- 
cedure for encoding the body of individual rows is the same as that used for 2- 
dimensional T.4-encoding and is described fully in the CCITT specifications. 

The beginning of the encoding for each row of a strip or tile is conducted as if 
there is an imaginary preceding (0-width) white pixel, that is as if a fresh run of 
white pixels has just commenced. The completion of each line is encoded as if 
there are imaginary pixels beyond the end of the current line, and of the preceding 
line, in effect, of colors chosen such that the line is exactly completable by a code 
word, making the imaginary next pixel a changing element that's not actually 
used. 

The encodings of successive lines follow contiguously in the binary T.6-Encoding 
stream with no special initiation or separation codewords. There are no provisions 
for fill codes or explicit end-of-line indicators. The encoding of the last line of the 
pixel array is followed immediately, in place of any additional line encodings, by 
a 24-bit End-of-Facsimile Block (EOFB). 

00000000000 100000000000 1 .B . 
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The EOFB sequence is immediately followed by enough 0-bit padding to fit the 
entire stream into a sequence of 8-bit bytes. 

General Application. Because of the single uniform encoding procedure, without 
disruptions by end-of-line codes and shifts into one-dimensional encodings, T.6- 
encoding is very popular for compression of bi-level images in document imaging 
systems. T.6-encoding trades off redundancy for minimum encoded size, relying 
on the underlying storage and transmission systems for reliable retention and 
communication of the encoded stream. 

TIFF readers will operate most smoothly by always ignoring bits beyond the 
EOFB. Some writers may produce additional bytes of pad bits beyond the byte 
containing the final bit of the EOFB. Robust readers will not be disturbed by this 
prospect. 

It is not possible to correctly decode aT.6-Encoding without knowledge of the 
exact number of pixels in each line of the pixel array. ImageWidth (or TileWidth, 
if used) must be stated exactly and accurately. If an image or segment is 
overscanned, producing extraneous pixels at the beginning or ending of lines, 
these pixels must be counted. Any cropping must be accomplished by other 
means. It is not possible to recover from a pixel-count deviation, even when one is 
detected. Failure of any row to be completed as expected is cause for abandoning 
further decoding of the entire segment. There is no requirement that ImageWidth 
be a multiple of eight, of course, and readers must be prepared to pad the final 
octet bytes of decoded bitmap rows with additional bits. 

If a TIFF reader encounters EOFB before the expected number of lines has been 
extracted, it is appropriate to assume that the missing rows consist entirely of 
white pixels. Cautious readers might produce an unobtrusive warning if such an 
EOFB is followed by anything other than pad bits. 

Readers that successfully decode the RowsPerStrip (or TileLength or residual 
ImageLength) number of lines are not required to verify that an EOFB follows. 
That is, it is generally appropriate to stop decoding when the expected lines are 
decoded or the EOFB is detected, whichever occurs first. Whether error indica- 
tions or warnings are also appropriate depends upon the application and whether 
more precise troubleshooting of encoding deviations is important. 

TIFF writers should always encode the full, prescribed number of rows, with a 
proper EOFB immediately following in the encoding. Padding should be by the 
least number of 0-bits needed for the T.6-encoding to exactly occupy a multiple of 
8 bits. Only 0-bits should be used for padding, and StripByteCount (or 
TileByteCount) should not extend to any bytes not containing properly-formed 
T.6-encoding. In addition, even though not required by T.6-encoding rules, suc- 
cessful interchange with a large variety of readers and applications will be en- 
hanced if writers can arrange for the number of pixels per line and the number of 
lines per strip to be multiples of eight. 

Uncompressed Mode. Although T.6-encodings of simple bi-level images result 
in data compressions of 10: 1 and better, some pixel-array patterns have T.6- 
encodings that require more bits than their simple bi-level bitmaps. When such 
cases are detected by encoding procedures, there is an optional extension for 
shifting to a form of uncompressed coding within the T.6-encoding string. 

Uncompressed mode is not well-specified and many applications discourage its 
usage, prefering alternatives such as different compressions on a segment-by- 
segment (strip or tile) basis, or by simply leaving the image uncompressed in its 
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entirety. The main complication for readers is in properly restoring T.6-encoding 
after the uncompressed sequence is laid down in the current row. 

Readers that have no provision for uncompressed mode will generally reject any 
case in which the flag is set. Readers that are able to process uncompressed-mode 
content within T.6-encoding strings can safely ignore this flag and simply process 
any uncompressed-mode occurences correctly. 

Writers that are unable to guarantee the absence of uncompressed-mode material 
in any of the T.6-encoded segments must set the flag. The flag should be cleared 
(or defaulted) only when absence of uncompressed-mode material is assured. 
Writers that are able to inhibit the generation of uncompressed-mode extensions 
are encouraged to do so in order to maximize the acceptability of their T.6-encod- 
ing strings in interchange situations. 

Because uncompressed-mode is not commonly used, the following description is 
best taken as suggestive of the general machinery. Interpolation of fine details can 
easily vary between implementations. 

Uncompressed mode is signalled by the occurence of the 10-bit extension code 
string 

00000011 11. B 

outside of any run-length make-up code or extension. Original unencoded image 
information follows. In this unencoded information, a 0-bit evidently signifies a 
white pixel, a 1-bit signifies a black pixel, and the TIFF Photometriclnterpretation 
will influence how these bits are mapped into any final uncompressed bitmap for 
use. The only modification made to the unencoded information is insertion of a 1- 
bit after every block of five consecutive 0-bits from the original image informa- 
tion. This is a transparency device that allows longer sequencences of 0-bits to be 
reserved for control conditions, especially ending the uncompressed-mode se- 
quence. When it is time to return to compressed mode, the 8-bit exit sequence 

OOOOOOlt.B 

is appended to the material. The 0-bits of the exit sequence are not considered in 
applying the 1 -bit insertion rule; up to four information 0-bits can legally precede 
the exit sequence. The trailing bit, 't,' specifies the color (via or 1) that is under- 
stood in the next run of compressed-mode encoding. This lets a color other than 
white be assumed for the 0-width pixel on the left of the edge between the last 
uncompressed pixel and the resumed 2-dimensional scan. 

Writers should confine uncompressed-mode sequences to the interiors of indi- 
vidual rows, never attempting to "wrap" from one row to the next. Readers must 
operate properly when the only encoding for a single row consists of an 
uncompressed-mode escape, a complete row of (proper 1 -inserted) uncompressed 
information, and the extension exit. Technically, the exit pixel, 't,' should prob- 
ably then be the opposite color of the last true pixel of the row, but readers should 
be generous in this case. 

In handling these complex encodings, the encounter of material from a defective 
source or a corrupted file is particularly unsettling and mysterious. Robust readers 
will do well to defend against falling off the end of the world; e.g., unexpected 
EOFB sequences should be handled, and attempted access to data bytes that are 
not within the bounds of the present segment (or the TIFF file itself) should be 
avoided. 
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Section 12: Document Storage and Retrieval 



These fields may be useful for document storage and retrieval applications. They 
will very likely be ignored by other applications. 



DocumentName 

The name of the document from which this image was scanned. 
Tag =269 (10D.H) 
Type = ASCII 
See also PageName. 

PageName 

The name of the page from which this image was scanned. 

Tag =285 (11D.H) 

Type = ASCII 

See also DocumentName. 

No default. 

PageNumber 

The page number of the page from which this image was scanned. 
Tag =297 (129.H) 
Type = SHORT 

N =2 

This field is used to specify page numbers of a multiple page (e.g. facsimile) docu- 
ment. PageNumber[0] is the page number; PageNumber[l] is the total number of 
pages in the document. If PageNumber[l] is 0, the total number of pages in the 
document is not available. 

Pages need not appear in numerical order. 

The first page is numbered (zero). 

No default. 

XPosition 

X position of the image. 
Tag =286 (11E.H) 
Type = RATIONAL 

N =1 
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The X offset in ResolutionUnits of the left side of the image, with respect to the 
left side of the page. 

No default. See also YPosition. 



YPosition 

Y position of the image. 
Tag =287 (11F.H) 
Type = RATIONAL 

N =1 

The Y offset in ResolutionUnits of the top of the image, with respect to the top of 
the page. In the TIFF coordinate scheme, the positive Y direction is down, so that 
YPosition is always positive. 

No default. See also XPosition. 



56 



TIFF 6.0 Specification 



Final — June 3, 1992 



Section 13: LZW Compression 



Restrictions 



Reference 



Characteristics 



This section describes TIFF compression scheme 5, an adaptive compression 
scheme for raster images. 



When LZW compression was added to the TIFF specification, in Revision 5.0, it 
was thought to be public domain. This is, apparently, not the case. 



The following paragraph has been approved by the Unisys Corporation: 

"The LZW compression method is said to be the subject of United States patent 
number 4,558,302 and corresponding foreign patents owned by the Unisys Cor- 
poration. Software and hardware developers may be required to license this patent 
in order to develop and market products using the TIFF LZW compression option. 
Unisys has agreed that developers may obtain such a license on reasonable, non- 
discriminatory terms and conditions. Further information can be obtained from: 
Welch Licensing Department, Office of the General Counsel, M/S C1SW19, 
Unisys Corporation, Blue Bell, Pennsylvania, 19424." 

Reportedly, there are also other companies with patents that may affect LZW 
implementors. 



Terry A. Welch, "A Technique for High Performance Data Compression", IEEE 
Computer, vol. 17 no. 6 (June 1984). Describes the basic Lempel-Ziv & Welch 
(LZW) algorithm in very general terms. The author's goal is to describe a hard- 
ware-based compressor that could be built into a disk controller or database en- 
gine and used on all types of data. There is no specific discussion of raster images. 
This section gives sufficient information so that the article is not required reading. 



LZW compression has the following characteristics: 

• LZW works for images of various bit depths. 

• LZW has a reasonable worst-case behavior. 

• LZW handles a wide variety of repetitive patterns well. 

• LZW is reasonably fast for both compression and decompression. 

• LZW does not require floating point software or hardware. 
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The Algorithm 



LZW is lossless. All information is preserved. But if noise or information is 
removed from an image, perhaps by smoothing or zeroing some low-order 
bitplanes, LZW compresses images to a smaller size. Thus, 5-bit, 6-bit, or 7-bit 
data masquerading as 8-bit data compresses better than true 8-bit data. Smooth 
images also compress better than noisy images, and simple images compress 
better than complex images. 

LZW works quite well on bilevel images, too. On our test images, it almost 
always beatPackBits and generally tied CCITT ID (Modified Huffman) com- 
pression. LZW also handles halftoned data better than most bilevel compres- 
sion schemes. 



Each strip is compressed independently. We strongly recommend that 
RowsPerStrip be chosen such that each strip contains about 8K bytes before com- 
pression. We want to keep the strips small enough so that the compressed and 
uncompressed versions of the strip can be kept entirely in memory, even on small 
machines, but are large enough to maintain nearly optimal compression ratios. 

The LZW algorithm is based on a translation table, or string table, that maps 
strings of input characters into codes. The TIFF implementation uses variable- 
length codes, with a maximum code length of 12 bits. This string table is different 
for every strip and does not need to be reatained for the decompressor. The trick is 
to make the decompressor automatically build the same table as is built when the 
data is compressed. We use a C-like pseudocode to describe the coding scheme: 
InitializeStringTable () ; 
WriteCode(ClearCode); 
0. = the empty string; 
for each character in the strip { 
K = GetNextCharacterO ; 
if fl+K is in the string table { 

Q. = fl+K; /* string concatenation */ 



} else { 

WriteCode (CodeFromString (Q.) ) ; 

AddTabieEntry(ft+K); 

fl = K; 

1 

} /* end of for loop */ 
WriteCode (CodeFromString (Q.) ) ; 
WriteCode (EndOf Information) ; 

That's it. The scheme is simple, although it is challenging to implement effi- 
ciently. But we need a few explanations before we go on to decompression. 

The "characters" that make up the LZW strings are bytes containing TIFF 
uncompressed (Compression=l) image data, in our implementation. For example, 
if BitsPerSample is 4, each 8-bit LZW character will contain two 4-bit pixels. If 
BitsPerSample is 16, each 16-bit pixel will span two 8-bit LZW characters. 

It is also possible to implement a version of LZW in which the LZW character 
depth equals BitsPerSample, as described in Draft 2 of Revision 5.0. But there is a 
major problem with this approach. If BitsPerSample is greater than 1 1 , we can not 
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use 1 2-bit-maximum codes and the resulting LZW table is unacceptably large. 
Fortunately, due to the adaptive nature of LZW, we do not pay a significant com- 
pression ratio penalty for combining several pixels into one byte before compress- 
ing. For example, our 4-bit sample images compressed about 3 percent worse, and 
our 1 -bit images compressed about 5 percent better. And it is easier to write an 
LZW compressor that always uses the same character depth than it is to write one 
that handles varying depths. 

We can now describe some of the routine and variable references in our 
pseudocode: 

InitializeStringTableO initializes the string table to contain all possible single- 
character strings. There are 256 of them, numbered through 255, since our char- 
acters are bytes. 

WriteCode() writes a code to the output stream. The first code written is a 
ClearCode, which is defined to be code #256. 

£2 is our "prefix string." 

GetNextCharacter() retrieves the next character value from the input stream. This 
will be a number between and 255 because our characters are bytes. 

The "+" signs indicate string concatenation. 

AddTableEntryO adds a table entry. (InitializeStringTableO has already put 256 
entries in our table. Each entry consists of a single-character string, and its associ- 
ated code value, which, in our application, is identical to the character itself. That 
is, the 0th entry in our table consists of the string <0>, with a corresponding code 
value of <0>, the 1st entry in the table consists of the string <1>, with a corre- 
sponding code value of <1> and the 255th entry in our table consists of the string 
<255>, with a corresponding code value of <255>.) So, the first entry that added 
to our string table will be at position 256, right? Well, not quite, because we re- 
serve code #256 for a special "Clear" code. We also reserve code #257 for a spe- 
cial "EndOflnformation" code that we write out at the end of the strip. So the first 
multiple-character entry added to the string table will be at position 258. 

For example, suppose we have input data that looks like this: 

Pixel 0:<7> 

Pixel 1:<7> 

Pixel2:<7> 

Pixel3:<8> 

Pixel4:<8> 

Pixel5:<7> 

Pixel6:<7> 

Pixel7:<6> 

Pixel8:<6> 

First, we read Pixel into K. QK is then simply <7>, because Q. is an empty string 
at this point. Is the string <7> already in the string table? Of course, because all 
single character strings were put in the table by InitializeStringTableO. So setQ 
equal to <7>, and then go to the top of the loop. 
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Read Pixel 1 into K. Does QK (<7><7>) exist in the string table? No, so we write 
the code associated with Q to output (write <7> to output) and add QK (<7><7>) 
to the table as entry 258. Store K (<7>) into Q. Note that although we have added 
the string consisting of Pixel and Pixel 1 to the table, we "re-use" Pixel 1 as the 
beginning of the next string. 

Back at the top of the loop, we read Pixel 2 into K. Does QK (<7><7>) exist in the 
string table? Yes, the entry we just added, entry 258, contains exactly <!><!>. So 
we add K to the end of Q so that Q is now <7><7>. 

Back at the top of the loop, we read Pixel 3 into K. Does QK (<7><7><8>) exist 
in the string table? No, so we write the code associated with Q (<258>) to output 
and then add QK to the table as entry 259. Store K (<8>) into Q. 

Back at the top of the loop, we read Pixel 4 into K. Does QK (<8><8>) exist in the 
string table? No, so we write the code associated with Q (<8>) to output and then 
add QK to the table as entry 260. Store K (<8>) into Q. 



Continuing, we get the following results: 



After reading: 


We wri 


Pixel 




Pixel 1 


<7> 


Pixel 2 




Pixel 3 


<258> 


Pixel 4 


<8> 


Pixel 5 


<8> 


Pixel 6 




Pixel 7 


<258> 


Pixel 8 


<6> 



And add table entry: 



258: <7><7> 

259: <7><7><8> 

260:<8><8> 

261:<8><7> 

262: <7><7><6> 
263: <6><6> 

WriteCode() also requires some explanation. In our example, the output code 
stream, <7><258><8><8><258><6> should be written using as few bits as pos- 
sible. When we are just starting out, we can use 9-bit codes, since our new string 
table entries are greater than 255 but less than 512. After adding table entry 511, 
switch to 10-bit codes (i.e., entry 512 should be a 10-bit code.) Likewise, switch to 
11-bit codes after table entry 1023, and 12-bit codes after table entry 2047. We 
will arbitrarily limit ourselves to 12-bit codes, so that our table can have at most 
4096 entries. The table should not be any larger. 

Whenever you add a code to the output stream, it "counts" toward the decision 
about bumping the code bit length. This is important when writing the last code 
word before an EOI code or ClearCode, to avoid code length errors. 

What happens if we run out of room in our string table? This is where the 
ClearCode comes in. As soon as we use entry 4094, we write out a (12-bit) 
ClearCode. (If we wait any longer to write the ClearCode, the decompressor 
might try to interpret the ClearCode as a 13-bit code.) At this point, the compres- 
sor reinitializes the string table and then writes out 9-bit codes again. 

Note that whenever you write a code and add a table entry, Q is not left empty. It 
contains exactly one character. Be careful not to lose it when you write an end-of- 
table ClearCode. You can either write it out as a 1 2-bit code before writing the 
ClearCode, in which case you need to do it right after adding table entry 4093, or 
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LZW Decoding 



you can write it as a 9-bit code after the ClearCode . Decompression gives the 
same result in either case. 

To make things a little simpler for the decompressor, we will require that each 
strip begins with a ClearCode and ends with an EndOflnformation code. Every 
LZW -compressed strip must begin on a byte boundary. It need not begin on a 
word boundary. LZW compression codes are stored into bytes in high-to-low- 
order fashion, i.e., FillOrder is assumed to be 1. The compressed codes are written 
as bytes (not words) so that the compressed data will be identical whether it is an 
'IF or 'MM' file. 

Note that the LZW string table is a continuously updated history of the strings that 
have been encountered in the data. Thus, it reflects the characteristics of the data, 
providing a high degree of adaptability. 



The procedure for decompression is a little more complicated: 

while ( (Code = GetNextCode () ) != EoiCode) { 
if (Code == ClearCode) ( 

InitializeTable () ; 
Code = GetNextCode (); 
if (Code == EoiCode) 

break; 
WriteString (StringFromCode (Code) ) ; 
OldCode = Code; 
} /* end of ClearCode case */ 
else ( 

if (IsInTable(Code) ) { 

WriteString (StringFromCode (Code) ) ; 
AddStringToTable (StringFromCode (OldCode 
) +FirstChar (StringFromCode (Code) ) ) ; 

OldCode = Code; 
} else { 

OutString = StringFromCode (OldCode) + 
FirstChar (StringFromCode (OldCode) ) ; 

WriteString (OutString) ; 
AddStringToTable (OutString) ; 
OldCode = Code; 
; 
) /* end of not-ClearCode case */ 
) /* end of while loop */ 

The function GetNextCode() retrieves the next code from the LZW-coded data. It 
must keep track of bit boundaries. It knows that the first code that it gets will be a 
9-bit code. We add a table entry each time we get a code. So, GetNextCode() must 
switch over to 10-bit codes as soon as string #570 is stored into the table. Simi- 
larly, the switch is made to 11-bit codes after #1022 and to 12-bit codes after 
#2046. 
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The function StringFromCode() gets the string associated with a particular code 
from the string table. 

The function AddStringToTable() adds a string to the string table. The "+" sign 
joining the two parts of the argument to AddStringToTable indicates string con- 
catenation. 

StringFromCodeO looks up the string associated with a given code. 

WriteStringO adds a string to the output stream. 



When SamplesPerPixel Is Greater Than 1 



So far, we have described the compression scheme as if SamplesPerPixel were 
always 1 , as is the case with palette-color and grayscale images. But what do we 
do with RGB image data? 



Tests on our sample images indicate that the LZW compression ratio is nearly 
identical whether PlanarConfiguration=l or PlanarConfiguration=2, for RGB 
images. So, use whichever configuration you prefer and simply compress the 
bytes in the strip. 

Note: Compression ratios on our test RGB images were disappointingly low: 
between 1.1 to 1 and 1.5 to 1, depending on the image. Vendors are urged to do 
what they can to remove as much noise as possible from their images. Preliminary 
tests indicate that significantly better compression ratios are possible with less- 
noisy images. Even something as simple as zeroing-outone or two least-signifi- 
cant bitp lanes can be effective, producing little or no perceptible image 
degradation. 



Implementation 



The exact structure of the string table and the method used to determine if a string 
is already in the table are probably the most significant design decisions in the 
implementation of a LZW compressor and decompressor. Hashing has been sug- 
gested as a useful technique for the compressor. We have chosen a tree-based 
approach, with good results. The decompressor is more straightforward and faster 
because no search is involved — strings can be accessed directly by code value. 



LZW Extensions 



Some images compress better using LZW coding if they are first subjected to a 
process wherein each pixel value is replaced by the difference between the pixel 
and the preceding pixel. See the following Section. 



62 



TIFF 6.0 Specification Final — June 3, 1992 



Acknowledgments 



See the first page of this section for the LZW reference. 

The use of ClearCode as a technique for handling overflow was borrowed from 
the compression scheme used by the Graphics Interchange Format (GIF), a small- 
color-paint-image-file format used by CompuServe that also uses an adaptation of 
the LZW technique. 
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Section 14: Differencing Predictor 



This section defines a Predictor that greatly improves compression ratios for some 
images. 

Predictor 

Tag =317(13D.H) 
Type = SHORT 

N =1 

A predictor is a mathematical operator that is applied to the image data before an 
encoding scheme is applied. Currently this field is used only with LZW (Com- 
pression=5) encoding because LZW is probably the only TIFF encoding scheme 
that benefits significantly from a predictor step. See Section 13. 

The possible values are: 
1 = No prediction scheme used before coding. 
2= Horizontal differencing. 

Default is 1 . 



The algorithm 



Make use of the fact that many continuous-tone images rarely vary much in pixel 
value from one pixel to the next. In such images, if we replace the pixel values by 
differences between consecutive pixels, many of the differences should be 0, plus 
or minus 1, and so on. This reduces the apparent information content and allows 
LZW to encode the data more compactly. 

Assuming 8-bit grayscale pixels for the moment, a basic C implementation might 
look something like this: 

char image [ ] [ ] ; 

int row, col; 

/* take horizontal differences: 
*/ 

for (row = 0; row < nrows; row++) 

for (col = ncols - 1; col >= 1; col — ) 

image [row] [col] -= image [row] [col-1]; 

If we don't have 8-bit components, we need to work a little harder to make better 
use of the architecture of most CPUs. Suppose we have 4-bit components packed 
two per byte in the normal TIFF uncompressed (i.e., Compression=l) fashion. To 
find differences, we want to first expand each 4-bit component into an 8-bit byte, 
so that we have one componentper byte, low-order justified. We then perform the 
horizontal differencing illustrated in the example above. Once the differencing 
has been completed, we then repack the 4-bit differences two to a byte, in the 
normal TIFF uncompressed fashion. 
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If the components are greater than 8 bits deep, expanding the components into 16- 
bit words instead of 8-bit bytes seems like the best way to perform the subtraction 
on most computers. 

Note that we have not lost any data up to this point, nor will we lose any data later 
on. It might seem at first that our differencing might turn 8-bit components into 9- 
bit differences, 4-bit components into 5-bit differences, and so on. But it turns out 
that we can completely ignore the "overflow" bits caused by subtracting a larger 
number from a smaller number and still reverse the process without error. Normal 
two's complement arithmetic does just what we want. Try an example by hand if 
you need more convincing. 

Up to this point we have implicitly assumed that we are compressing bilevel or 
grayscale images. An additional consideration arises in the case of color images. 

If PlanarConfiguration is 2, there is no problem. Differencing works the same as it 
does for grayscale data. 

If PlanarConfiguration is 1, however, things get a little trickier. If we didn't do 
anything special, we would subtract red component values from green component 
values, green component values from blue component values, and blue compo- 
nent values from red component values. This would not give the LZW coding 
stage much redundancy to work with. So, we will do our horizontal differences 
with an offset of SamplesPerPixel (3, in the RGB case). In other words, we will 
subtract red from red, green from green, and blue from blue. The LZW coding 
stage is identical to the SamplesPerPixel=l case. We require thatBitsPerSample 
be the same for all 3 components. 



Results and Guidelines 



LZW without differencing works well for 1 -bit images, 4-bit grayscale images, 
and many palette-color images. But natural 24-bit color images and some 8-bit 
grayscale images do much better with differencing. 



Although the combination of LZW coding with horizontal differencing does not 
result in any loss of data, it may be worthwhile in some situations to give up some 
information by removing as much noise as possible from the image data before 
doing the differencing, especially with 8-bit components. The simplest way to get 
rid of noise is to mask off one or two low-order bits of each 8-bit component. On 
our 24-bit test images, LZW with horizontal differencing yielded an average 
compression ratio of 1 .4 to 1 . When the low-order bit was masked from each 
component, the compression ratio climbed to 1.8 to 1; the compression ratio was 
2.4 to 1 when masking two bits, and 3.4 to 1 when masking three bits. Of course, 
the more you mask, the more you risk losing useful information along with the 
noise. We encourage you to experiment to find the best compromise for your 
device. For some applications, it may be useful to let the user make the final deci- 
sion. 

Incidentally, we tried taking both horizontal and vertical differences, but the extra 
complexity of two-dimensional differencing did not appear to pay off for most of 
our test images. About one third of the images compressed slightly better with 
two-dimensional differencing, about one third compressed slightly worse, and the 
rest were about the same. 
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Section 15: Tiled Images 



Introduction 



Motivation 

This section describes how to organize images into tiles instead of strips. 

For low-resolution to medium-resolution images, the standard TIFF method of 
breaking the image into strips is adequate. However high-resolution images can 
be accessed more efficiently — and compression tends to work better — if the im- 
age is broken into roughly square tiles instead of horizontally-wide but vertically- 
narrow strips. 



Relationship to existing fields 

When the tiling fields described below are used, they replace the 
StripOffsets, StripByteCounts, and RowsPerStrip fields. Use of tiles will 
therefore cause older TIFF readers to give up because they will have no way of 
knowing where the image data is or how it is organized. Do not use both strip- 
oriented and tile-oriented fields in the same TIFF file. 



Padding 

Tile size is defined by TileWidth and TileLength. All tiles in an image are the 
same size; that is, they have the same pixel dimensions. 

Boundary tiles are padded to the tile boundaries. For example, if TileWidth is 64 
and ImageWidth is 129, then the image is 3 tiles wide and 63 pixels of padding 
must be added to fill the rightmost column of tiles. The same holds for TileLength 
and ImageLength. It doesn't matter what value is used for padding, because good 
TIFF readers display only the pixels defined by ImageWidth and ImageLength 
and ignore any padded pixels. Some compression schemes work best if the pad- 
ding is accomplished by replicating the last column and last row instead of pad- 
ding with 0's. 

The price for padding the image out to tile boundaries is that some space is 
wasted. But compression usually shrinks the padded areas to almost nothing. 
Even if data is not compressed, remember that tiling is intended for large images. 
Large images have lots of comparatively small tiles, so that the percentage of 
wasted space will be very small, generally on the order of a few percent or less. 

The advantages of padding an image to the tile boundaries are that implementa- 
tions can be simpler and faster and that it is more compatible with tile-oriented 
compression schemes such as JPEG. See Section 22. 

Tiles are compressed individually, just as strips are compressed. That is, each row 
of data in a tile is treated as a separate "scanline" when compressing. Compres- 
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Fields 



sion includes any padded areas of the rightmost and bottom tiles so that all the 
tiles in an image are the same size when uncompressed. 

All of the following fields are required for tiled images: 



TileWidth 

Tag =322 (142.H) 

Type = SHORT or LONG 

N =1 

The tile width in pixels. This is the number of columns in each tile. 

Assuming integer arithmetic, three computed values that are useful in the follow- 
ing field descriptions are: 

TilesAcross = (ImageWidth + TileWidth - 1) / TileWidth 

TilesDown = (ImageLength + TileLength - 1) / TileLength 

TilesPerlmage = TilesAcross * TilesDown 

These computed values are not TIFF fields; they are simply values determined by 
the ImageWidth, TileWidth, ImageLength, and TileLength fields. 

TileWidth and ImageWidth together determine the number of tiles that span the 
width of the image (TilesAcross). TileLength and ImageLength together deter- 
mine the number of tiles that span the length of the image (TilesDown). 

We recommend choosing TileWidth and TileLength such that the resulting tiles 
are about 4K to 32K bytes before compression. This seems to be a reasonable 
value for most applications and compression schemes. 

TileWidth must be a multiple of 16. This restriction improves performance in 
some graphics environments and enhances compatibility with compression 
schemes such as JPEG. 

Tiles need not be square. 

Note that ImageWidth can be less than TileWidth, although this means that the 
tiles are too large or that you are using tiling on really small images, neither of 
which is recommended. The same observation holds for ImageLength and 
TileLength. 

No default. See also TileLength, TileOffsets, TileByteCounts. 



TileLength 

Tag =323 (143.H) 
Type = SHORT or LONG 

N =1 
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The tile length (height) in pixels. This is the number of rows in each tile. 

TileLength must be a multiple of 16 for compatibility with compression schemes 
such as JPEG. 

Replaces RowsPerStrip in tiled TIFF files. 

No default. See also TileWidth, TileOffsets, TileByteCounts. 

TileOffsets 

Tag =324 (144.H) 

Type = LONG 

N = TilesPerlmage for PlanarConfiguration = 1 

= SamplesPerPixel * TilesPerlmage for PlanarConfiguration = 2 

For each tile, the byte offset of that tile, as compressed and stored on disk. The 
offset is specified with respect to the beginning of the TIFF file. Note that this 
implies that each tile has a location independent of the locations of other tiles. 

Offsets are ordered left-to-right and top-to-bottom. For PlanarConfiguration = 2, 
the offsets for the first component plane are stored first, followed by all the offsets 
for the second component plane, and so on. 

No default. See also TileWidth, TileLength, TileByteCounts. 

TileByteCounts 

Tag =325 (145.H) 

Type = SHORT or LONG 

N = TilesPerlmage for PlanarConfiguration = 1 

= SamplesPerPixel * TilesPerlmage for PlanarConfiguration = 2 
For each tile, the number of (compressed) bytes in that tile. 
See TileOffsets for a description of how the byte counts are ordered. 
No default. See also TileWidth, TileLength, TileOffsets. 
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Section 16: CMYK Images 



Motivation 



This section describes how to store separated (usually CMYK) image data in a 
TIFF file. 

In a separated image, each pixel consists of N components. Each component 
represents the amount of a particular ink that is to be used to represent the image at 
that location, typically using a halftoning technique. 

For example, in a CMYK image, each pixel consists of 4 components. Each com- 
ponent represents the amount of cyan, magenta, yellow, or black process ink that 
is to be used to represent the image at that location. 

The fields described in this section can be used for more than simple 4-color pro- 
cess (CMYK) printing. They can also be used for describing an image made up of 
more than 4 inks, such an image made up of a cyan, magenta, yellow, red, green, 
blue, and black inks. Such an image is sometimes called a high-fidelity image and 
has the advantage of slightly extending the printed color gamut. 

Since separated images are quite device-specific and are restricted to color pre- 
press use, they should not be used for general image data interchange. Separated 
images are to be used only for prepress applications in which the imagesetter, 
paper, ink, and printing press characteristics are known by the creator of the sepa- 
rated image. 

Note: there is no single method of converting RGB data to CMYK data and back. 
In a perfect world, something close to cyan = 255-red, magenta = 255-green, and 
yellow = 255-blue should work; but characteristics of printing inks and printing 
presses, economics, and the fact that the meaning of RGB itself depends on other 
parameters combine to spoil this simplicity. 



Requirements 



In addition to satisfying the normal Baseline TIFF requirements, a separated TIFF 
file must have the following characteristics: 

• SamplesPerPixel = N. SHORT. The number of inks. (For example, N=4 for 
CMYK, because we have one component each for cyan, magenta, yellow, and 
black.) 

• BitsPerSample = 8,8,8,8 (for CMYK). SHORT. For now, only 8-bit compo- 
nents are recommended. The value "8" is repeated SamplesPerPixel times. 

• Photometriclnterpretation = 5 (Separated - usually CMYK). SHORT. 

The components represent the desired percent dot coverage of each ink, where 
the larger component values represent a higher percentage of ink dot coverage 
and smaller values represent less coverage. 
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Fields 



In addition, there are some new fields, all of which are optional. 

InkSet 

Tag =332(14C.H) 

Type = SHORT 

N =1 

The set of inks used in a separated (PhotometricInterpretation=5) image. 

1 = CMYK. The order of the components is cyan, magenta, yellow, black. Usually, a 

value of represents 0% ink coverage and a value of 255 represents 100% ink 
coverage for that component, but see DotRange below. The InkNames field 
should not exist when InkSet= 1 . 

2 = not CMYK. See the InkNames field for a description of the inks to be used. 

Default is 1 (CMYK). 

NumberOflnks 

Tag =334(14E.H) 
Type = SHORT 

N =1 

The number of inks. Usually equal to SamplesPerPixel, unless there are extra 
samples. 

See also ExtraSamples. 

Default is 4. 

InkNames 

Tag =333(14D.H) 
Type = ASCII 

N = total number of characters in all the ink name strings, including the 

NULs. 

The name of each ink used in a separated (PhotometricInterpretation=5) image, 
written as a list of concatenated, NUL-terminated ASCII strings. The number of 
strings must be equal to NumberOflnks. 

The samples are in the same order as the ink names. 

See also InkSet, NumberOflnks. 

No default. 
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DotRange 

Tag =336(150.H) 
Type = BYTE or SHORT 

N = 2, or 2*SamplesPerPixel 

The component values that correspond to a 0% dot and 100% dot. DotRange[0] 
corresponds to a 0% dot, and DotRange[l] corresponds to a 100% dot. 

If a DotRange pair is included for each component, the values for each component 
are stored together, so that the pair for Cyan would be first, followed by the pair 
for Magenta, and so on. Use of multiple dot ranges is, however, strongly discour- 
aged in the interests of simplicity and compatibility with ANSI IT8 standards. 

A number of prepress systems like to keep some "headroom" and "footroom" on 
both ends of the range. What to do with components that are less than the 0% aim 
point or greater than the 100% aim point is not specified and is application-depen- 
dent. 

It is strongly recommended that a CMYK TIFF writer not attempt to use this field 
to reverse the sense of the pixel values so that smaller values mean more ink in- 
stead of less ink. That is, DotRange[0] should be less than DotRange[l]. 

DotRange[0] and DotRange[l] must be within the range [0, (2**BitsPerSample) - 
1]. 

Default: a component value of corresponds to a 0% dot, and a component value 
of 255 (assuming 8-bit pixels) corresponds to a 100% dot. That is, DotRange[0] = 
andDotRange[l] = (2**BitsPerSample) - 1. 

TargetPrinter 

Tag =337(151.H) 
Type = ASCII 

N =any 

A description of the printing environment for which this separation is intended. 



History 



This Section has been expanded from earlier drafts, with the addition of the 
InkSet, InkNames, NumberOflnks, DotRange, and TargetPrinter, but is 

backward-compatible with earlier draft versions. 

Possible future enhancements: definition of the characterization information so 
that the CMYK data can be retargeted to a different printing environment and so 
that display on a CRT or proofing device can more accurately represent the color. 
ANSI IT8 is working on such a proposal. 
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Section 17: HalftoneHints 



This section describes a scheme for properly placing highlights and shadows in 
half toned images. 



Introduction 



The single most easily recognized failing of continuous tone images is the incor- 
rect placement of highlight and shadow. It is critical that a halftone process be 
capable of printing the lightest areas of the image as the smallest halftone spot 
capable of the output device, at the specified printer resolution and screen ruling. 
Specular highlights (small ultra-white areas) as well as the shadow areas should 
be printable as paper only. 



Consistency in highlight and shadow placement allows the user to obtain predict- 
able results on a wide variety of halftone output devices. Proper implementation 
of theHalftoneHints field will provide a significant step toward device indepen- 
dent imaging, such that low cost printers may to be used as effective proofing 
devices for images which will later be halftoned on a high-resolution imagesetter. 



The HalftoneHints Field 



HalftoneHints 

Tag =321(141.H) 
Type = SHORT 

N =2 

The purpose of the HalftoneHints field is to convey to the halftone function the 
range of gray levels within a colorimetrically-specified image that should retain 
tonal detail. The field contains two values of sixteen bits each and, therefore, is 
contained wholly within the field itself; no offset is required. The first word speci- 
fies the highlight gray level which should be halftoned at the lightest printable tint 
of the final output device. The second word specifies the shadow gray level which 
should be halftoned at the darkest printable tint of the final output device. Portions 
of the image which are whiter than the highlight gray level will quickly, if not 
immediately, fade to specular highlights. There is no default value specified, since 
the highlight and shadow gray levels are a function of the subject matter of a par- 
ticular image. 

Appropriate values may be derived algorithmically or may be specified by the 
user, either directly or indirectly. 

The HalftoneHints field, as defined here, defines an achromatic function. It can be 
used just as effectively with color images as with monochrome images. When 
used with opponent color spaces such as CIE L*a*b* or YCbCr, it refers to the 
achromatic component only; L* in the case of CIELab, and Y in the case of 
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YCbCr. When used with tri-stimulus spaces such as RGB, it suggests to retain 
tonal detail for all colors with an NTSC gray component within the bounds of the 
R=G=B=Highlight to R=G=B=Shadow range. 



Comments for TIFF Writers 



TIFF writers are encouraged to include the HalftoneHints field in all color or 
grayscale images where BitsPerSample >1. Although no default value is speci- 
fied, prior to the introduction of this field it has been common practice to implic- 
itly specify the highlight and shadow gray levels as 1 and 2**BitsperSample-2 
and manipulate the image data to this definition. There are some disadvantages to 
this technique, and it is not feasible for a fixed gamut colorimetric image type. 
Appropriate values may be derived algorithmically or may be specified by the 
user directly or indirectly. Automatic algorithms exist for analyzing the histogram 
of the achromatic intensity of an image and defining the minimum and maximum 
values as the highlight and shadow settings such that tonal detail is retained 
throughout the image. This kind of algorithm may try to impose a highlight or 
shadow where none really exists in the image, which may require user controls to 
override the automatic setting. 



It should be noted that the choice of the highlight and shadow values is somewhat 
output dependent. For instance, in situations where the dynamic range of the 
output medium is very limited (as in newsprint and, to a lesser degree, laser out- 
put), it may be desirable for the user to clip some of the lightest or darkest tones to 
avoid the reduced contrast resulting from compressing the tone of the entire im- 
age. Different settings might be chosen for 150-line halftone printed on coated 
stock. Keep in mind that these values may be adjusted later (which might not be 
possible unless the image is stored as a colorimetric, fixed, full-gamut image), and 
that more sophisticated page-layout applications may be capable of presenting a 
user interface to consider these decisions at a point where the halftone process is 
well understood. 

It should be noted that although CCDs are linear intensity detectors, TIFF writers 
may choose to manipulate the image to store gamma-compensated data. Gamma- 
compensated data is more efficient at encoding an image than is linear intensity 
data because it requires fewer BitsPerPixel to eliminate banding in the darker 
tones. It also has the advantage of being closer to the tone response of the display 
or printer and is, therefore, less likely to produce poor results from applications 
that are not rigorous about their treatment of images. Be aware that the 
Photometriclnterpretation value of or 1 (grayscale) implies linear data because 
no gamma is specified. The Photometriclnterpretation value of 2 (RGB data) 
specifies the NTSC gamma of 2.2 as a default. If data is written as something 
other than the default, then a GrayResponseCurve field or a TransferFunction 
field must be present to define the deviation. For grayscale data, be sure that the 
densities in the GrayResponseCurve are consistent with the 
Photometriclnterpretation field and the HalftoneHints field. 
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Comments for TIFF Readers 



TIFF readers that send a grayscale image to a halftone output device, whether it is 
a binary laser printer or a PostScript imagesetter should make an effort to maintain 
the highlight and shadow placement. This requires two steps. First, determine the 
highlight and shadow gray level of a particular image. Second, communicate that 
information to the halftone engine. 

To determine the highlight and shadow gray levels, begin by looking for a 
HalftoneHints field. If it exists, it takes precedence. The first word represents the 
gray level of the highlight and the second word represents the gray level of the 
shadow. If the image is a colorimetric image (i.e. it has a GrayResponseCurve 
field or a TransferFunction field) but does not contain a HalftoneHints field, then 
the gamut mapping techniques described earlier should be used to determine the 
highlight and shadow values. If neither of these conditions are true, then the file 
should be treated as if a HalftoneHints field had indicated a highlight at gray level 
1 and a shadow at gray level 2**BitsPerPixel-2 (or vice-versa depending on the 
Photometriclnterpretation field). Once the highlight and shadow gray levels have 
been determined, the next step is to communicate this information to the halftone 
module. The halftone module may exist within the same application as the TIFF 
reader, it may exist within a separate printer driver, or it may exist within the 
Raster Image Processor (RIP) of the printer itself. Whether the halftone process is 
a simple dither pattern or a general purpose spot function, it has some gray level at 
which the lightest printable tint will be rendered. The HalftoneHint concept is best 
implemented in an environment where this lightest printable tint is easily and 
consistently specified. 

There are several ways in which an application can communicate the highlight 
and shadow to the halftone function. Some environments may allow the applica- 
tion to pass the highlight and shadow to the halftone module explicitly along with 
the image. This is the best approach, but many environments do not yet provide 
this capability. Other environments may provide fixed gray levels at which the 
highlight and shadow will be rendered. For these cases, the application should 
build a tone map that matches the highlight and shadow specified in the image to 
the highlight and shadow gray level of the halftone module. This approach re- 
quires more work by the application software, but will provide excellent results. 
Some environments will not have any consistent concept of highlight and shadow 
at all. In these environments, the best an application can do is characterize each of 
the supported printers and save the observed highlight and shadow gray levels. 
The application can then use these values to achieve the desired results, providing 
the environment doesn't change. 

Once the highlight and shadow areas are selected, care should be taken to appro- 
priately map intermediate gray levels to those expected by the halftone engine, 
which may or may not be linear Reflectance. Note that although CCDs are linear 
intensity detectors and many TIFF files are stored as linear intensity, most output 
devices require significant tone compensation (sometimes called gamma correc- 
tion) to correctly display or print linear data. Be aware that the 
Photometriclnterpretation value of 0, 1 implies linear data because no gamma is 
specified. The Photometriclnterpretation value of 2 (RGB data) specifies the 
NTSC gamma of 2.2 as a default. If a GrayResponseCurve field or a 
TransferFunction field is present, it may define something other than the default. 
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Some Background on the Halftone Process 

To obtain the best results when printing a continuous-tone raster image, it is sel- 
dom desirable to simply reproduce the tones of the original on the printed page. 
Most often there is some gamut mapping required. Often this is because the tonal 
range of the original extends beyond the tonal range of the output medium. In 
some cases, the tone range of the original is within the gamut of the output me- 
dium, but it may be more pleasing to expand the tone of the image to fill the range 
of the output. Given that the tone of the original is to be adjusted, there is a whole 
range of possibilities for the level of sophistication that may be undertaken by a 
software application. 

Printing monochrome output is far less sophisticated than printing color output. 
For monochrome output the first priority is to control the placement of the high- 
light and the shadow. Ideally, a quality halftone will have sufficient levels of gray 
so that a standard observer cannot distinguish the interface between any two adja- 
cent levels of gray. In practice, however, there is often a significant step between 
the tone of the paper and the tone of the lightest printable tint. Although usually 
less severe, the problem is similar between solid ink and the darkest printable tint. 
Since the dynamic range between the lightest printable tint and the darkest print- 
able tint is usually less than one would like, it is common to maximize the tone of 
the image within these bounds. Not all images will have a highlight (an area of the 
image which is desirable to print as light as possible while still retaining tonal 
detail). If one exists, it should be carefully controlled to print at the lightest print- 
able tint of the output medium. Similarly, the darkest areas of the image to retain 
tonal detail should be printed as the darkest printable tint of the output medium. 
Tones lighter or darker than these may be clipped at the limits of the paper and 
ink. Satisfactory results may be obtained in monochrome work by doing nothing 
more than a perceptually -linear mapping of the image between these rigorously 
controlled endpoints. This level of sophistication is sufficient for many mid-range 
applications, although the results often appear flatter (i.e. lower in contrast) than 
desired. 

The next step is to increase contrast slightly in the tonal range of the image that 
contains the most important subject matter. To perform this step well requires 
considerably more information about the image and about the press. To know 
where to add contrast, the algorithm must have access to first the keyness of the 
image; the tone range which the user considers most important. To know how 
much contrast to add, the algorithm must have access to the absolute tone of the 
original and the dynamic range of the output device so that it may calculate the 
amount of tone compression to which the image is actually subjected. 

Most images are called normal key. The important subject areas of a normal key 
image are in the midtones. These images do well when a so-called "sympathetic 
curve" is applied, which increases the contrast in midtones slightly at the expense 
of contrast in the lighter and darker tones. White china on a white tablecloth is an 
example of a high key image. High key images benefit from higher contrast in 
lighter tones, with less contrast needed in the midtones and darker tones. Low key 
images have important subject matter in the darker tones and benefit from increas- 
ing the contrast in the darker tones. Specifying the keyness of an image might be 
attempted by automatic techniques, but it will likely fail without user input. For 
example, a photo of a bride in a white wedding dress it may be a high key image if 
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you are selling wedding dresses, but may be a normal key image if you are the 
parents of the bride and are more interested in her smile. 

Sophisticated color reproduction employs all of these principles, and then applies 
them in three dimensions. The mapping of the highlight and shadow becomes 
only one small, albeit critical, portion of the total issue of mapping colors that are 
too saturated for the output medium. Here again, automatic techniques may be 
employed as a first pass, with the user becoming involved in the clip or compress 
mapping decision. The HalftoneHints field is still useful in communicating which 
portions of the intensity of the image must be retained and which may be clipped. 
Again, a sophisticated application may override these settings if later user input is 
received. 
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Section 18: Associated Alpha Handling 



Introduction 



Fields 



This section describes a scheme for handling images with alpha data. 



A common technique in computer graphics is to assemble an image from one or 
more elements that are rendered separately. When elements are combined using 
compositing techniques, matte or coverage information must be present for each 
pixel to create a properly anti-aliased accumulation of the full image [Porter84]. 
This matting information is an example of additional per -pixel data that must be 
maintained with an image. This section describes how to use the ExtraSamples 
field to store the requisite matting information, commonly called the associated 
alpha or just alpha. This scheme enables efficient manipulation of image data 
during compositing operations. 



Images with matting information are stored in their natural format but with an 
additional component per pixel. The ExtraSample field is included with the image 
to indicate that an extra component of each pixel contains associated alpha data. In 
addition, when associated alpha data are included with RGB data, the RGB com- 
ponents must be stored premultiplied by the associated alpha component and 
component values in the range [0,2**BitsPerSample-l] are implicitly mapped 
onto the [0,1] interval. That is, for each pixel (r,g,b) and opacity A, where r, g, b, 
and A are in the range [0,1], (A*r,A*g,A*b,A) must be stored in the file. If A is 
zero, then the color components should be interpreted as zero. Storing data in this 
pre-multiplied format, allows compositing operations to be implemented most 
efficiently. In addition, storing pre-multiplied data makes it possible to specify 
colors with components outside the normal [0,1] interval. The latter is useful for 
defining certain operations that effect only the luminescence [Porter84] . 



ExtraSamples 

Tag =338(152.11) 
Type = SHORT 

N =1 

This field must have a value of 1 (associated alpha data with pre-multiplied color 
components). The associated alpha data stored in component SamplesPerPixel-1 
of each pixel contains the opacity of that pixel, and the color information is pre- 
multiplied by alpha. 
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Comments 



Associated alpha data is just another component added to each pixel. Thus, for 
example, its size is defined by the value of the BitsPerSample field. 

Note that since data is stored with RGB components already multiplied by alpha, 
naive applications that want to display an RGB A image on a display can do so 
simply by displaying the RGB component values. This works because it is effec- 
tively the same as merging the image with a black background. That is, to merge 
one image with another, the color of resultant pixels are calculated as: 

C = C *A +C , *(1-A ) 

r aver over under v crer' 

Since the "under image" is a black background, this equation reduces to 
C = C *A 

r over o v e r 

which is exactly the pre-multiplied color; i.e. what is stored in the image. 

On the other hand, to print an RGBA image, one must composite the image over a 
suitable background page color. For a white background, this is easily done by 
adding 1 - A to each color component. For an arbitrary background color C h t , the 
printed color of each pixel is 

C = C + C * (1-A ) 

pnnj lirt age back v im age' 

(since C. t is pre-multiplied). 

Since the ExtraSamples field is independent of other fields, this scheme permits 
alpha information to be stored in whatever organization is appropriate. In particu- 
lar, components can be stored packed (PlanarConfiguration=l); this is important 
for good I/O performance and for good memory access performance on machines 
that are sensitive to data locality. However, if this scheme is used, TIFF readers 
must not derive the SamplesPerPixel from the value of the 
Photometriclnterpretation field (e.g., if RGB, then SamplesPerPixel is 3). 

In addition to being independent of data storage-related fields, the field is also 
independent of the Photometriclnterpretation field. This means, for example, that 
it is easy to use this field to specify grayscale data and associated matte informa- 
tion. Note that a palette-color image with associated alpha will not have the 
colormap indices pre-multiplied; rather, the RGB colormap values will be pre- 
multiplied. 

Unassociated Alpha and Transparency Masks 

Some image manipulation applications support notions of transparency masks 
and soft-edge masks. The associated alpha information described in this section is 
different from this unassociated alpha information in many ways, most impor- 
tantly: 

• Associated alpha describes opacity or coverage at each pixel, while clipping- 
related alpha information describes a boolean relationship. That is, associated 
alpha can specify fractional coverage at a pixel, while masks specify either or 
100 percent coverage. 

• Once defined, associated alpha is not intended to be removed or edited, except 
as a result of compositing the image; it is an integral part of an image. 
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Unassociated alpha, on the other hand, is designed as an ancillary piece of 
information. 
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Section 19: Data Sample Format 



Fields 



This section describes a scheme for specifying data sample type information. 

TIFF implicitly types all data samples as unsigned integer values. Certain applica- 
tions, however, require the ability to store image-related data in other formats 
such as floating point. This section presents a scheme for describing a variety of 
data sample formats. 



SampleFormat 

Tag =339(153.H) 

Type = SHORT 

N = SamplesPerPixel 

This field specifies how to interpret each data sample in a pixel. Possible values 
are: 

1 = unsigned integer data 

2 = two ' s complement signed integer data 

3 = IEEE floating point data [IEEE] 

4 = undefined data format 

Note that the SampleFormat field does not specify the size of data samples; this is 
still done by the BitsPerSample field. 

A field value of "undefined" is a statement by the writer that it did not know how 
to interpret the data samples; for example, if it were copying an existing image. A 
reader would typically treat an image with "undefined" data as if the field were 
not present (i.e. as unsigned integer data). 

Default is 1 , unsigned integer data. 

SM in Sample Value 

Tag =340(154.H) 

Type = the field type that best matches the sample data 

N = SamplesPerPixel 

This field specifies the minimum sample value. Note that a value should be given 
for each data sample. That is, if the image has 3 SamplesPerPixel, 3 values must 
be specified. 

The default for SMinSampleValue and SMaxSampleValue is the full range of the 
data type. 
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SMaxSample Value 

Tag =341(155.H) 

Type = the field type that best matches the sample data 

N = SamplesPerPixel 

This new field specifies the maximum sample value. 

Comments 

The SampleFormat field allows more general imaging (such as image processing) 
applications to employ TIFF as a valid file format. 

SMinSampleValue and SMaxSampleValue become more meaningful when im- 
age data is typed. The presence of these fields makes it possible for readers to 
assume that data samples are bound to the range [SMinSampleValue, 
SMaxSampleValue] without scanning the image data. 
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Section 20: RGB Image Colorimetry 



Introduction 



Without additional information, RGB data is device-specific; that is, without an 
absolute color meaning. This section describes a scheme for describing and char- 
acterizing RGB image data. 



Color printers, displays, and scanners continue to improve in quality and avail- 
ability while they drop in price. Now the problem is to display color images so 
that they appear to be identical on different hardware. 

The key to reproducing the same color on different devices is to use the CIE 193 1 
XYZ color-matching functions, the international standard for color comparison. 
Using CIE XYZ, an image's colorimetry information can fully describe its color 
interpretation. The approach taken here is essentially calibrated RGB . It implies a 
transformation from the RGB color space of the pixels to CIE 1931 XYZ. 

The appearance of a color depends not only on its absolute tristimulus values, but 
also on the conditions under which it is viewed, including the nature of the sur- 
round and the adaptation state of the viewer. Colors having the same absolute 
tristimulus values appear the same in identical viewing conditions. The more 
complex issue of color appearance under different viewing conditions is ad- 
dressed by [4]. The colorimetry information presented here plays an important 
role in color appearance under different viewing conditions. 

Assuming identical viewing conditions, an application using the tags described 
below can display an image on different hardware and achieve colorimetrically 
identical results. The process of using this colorimetry information for displaying 
an image is straightforward on a color monitor but it is more complex for color 
printers. Also, the results will be limited by the color gamut and other characteris- 
tics of the display or printing device. 

The following fields describe the image colorimetry information of a TIFF image: 

WhitePoint chromaticity of the white point of the image 

PrimaryChromaticities chromaticities of the primaries of the image 

TransferF unction transfer function for the pixel data 

TransferRange extends the range of the transfer function 

ReferenceBlackWhite pixel component headroom and footroom parameters 

The TransferFunction, TransferRange, and ReferenceBlackWhite fields have 
defaults based on industry standards. An image has a colorimetric interpretation if 
and only if both the WhitePoint and PrimaryChromaticities fields are present. An 
image without these colorimetry fields will be displayed in an application and 
hardware dependent manner. 

Note: In the following definitions, BitsPerSample is used as if it were a single 
number when in fact it is an array of SamplesPerPixel numbers. The elements of 
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this array may not always be equal, for example: 5/6/5 16-bit pixels. 
BitsPerSample should be interpreted as the BitsPerSample value associated with a 
particular component. In the case of unequal BitsPerSample values, the defini- 
tions below can be extended in a straightforward manner. 

This section has the following differences with Appendix H in TIFF 5.0: 

• removed the use of image colorimetry defaults 

• renamed the ColorResponseCurves field as TransferFunction 

• optionally allowed a single TransferFunction table to describe all three chan- 
nels 

• described the use of the TransferFunction field for YCbCr, Palette, 
WhitelsZero and BlacklsZero Photometriclnterpretation types 

• added the TransferRange tag to expand the range of the TransferFunction 
below black and above white 

• added the ReferenceBlackWhite field 

• addressed the issue of color appearance 



Colorimetry Field Definitions 



WhitePoint 

Tag =318(13E.H) 
Type = RATIONAL 

N = 2 

The chromaticity of the white point of the image. This is the chromaticity when 
each of the primaries has its ReferenceWhite value. The value is described using 
the 193 1 CIE xy chromaticity diagram and only the chromaticity is specified. 
This value can correspond to the chromaticity of the alignment white of a monitor, 
the filter set and light source combination of a scanner or the imaging model of a 
rendering package. The ordering is white[x], white[y]. 

For example, the CIE Standard Illuminant D65 used by CCIR Recommendation 
709 and Kodak Photo YCC is: 

3127/10000,3290/10000 

No default. 



PrimaryChromaticities 

Tag =319(13F.H) 
Type = RATIONAL 

N =6 
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The chromaticities of the primaries of the image. This is the chromaticity for each 
of the primaries when it has its ReferenceWhite value and the other primaries 
have their ReferenceBlack values. These values are described using the 193 1 CIE 
xy chromaticity diagram and only the chromaticities are specified. These values 
can correspond to the chromaticities of the phosphors of a monitor, the filter set 
and light source combination of a scanner or the imaging model of a rendering 
package. The ordering is red[x], red[y], green[x], green[y], blue[x], and blue[y]. 

For example the CCIR Recommendation 709 primaries are: 

640/1000,330/1000, 

300/1000,600/1000, 

150/1000, 60/1000 
No default. 

TransferFunction 

Tag =301 (12D.H) 

Type = SHORT 

N = { 1 or 3 } * ( 1 « BitsPerSample) 

Describes a transfer function for the image in tabular style. Pixel components can 
be gamma-compensated, companded, non-uniformly quantized, or coded in some 
other way. The TransferFunction maps the pixel components from a non-linear 
BitsPerSample (e.g. 8-bit) form into a 16-bit linear form without a perceptible loss 
of accuracy. 

If N = 1 « BitsPerSample, the transfer function is the same for each channel and 
all channels share a single table. Of course, this assumes that each channel has the 
same BitsPerSample value. 

If N = 3 * (1 « BitsPerSample), there are three tables, and the ordering is the 
same as it is for pixel components of the Photometriclnterpretation field. These 
tables are separate and not interleaved. For example, with RGB images all red 
entries come first, followed by all green entries, followed by all blue entries. 

The length of each component table is 1 « BitsPerSample. The width of each 
entry is 16 bits as implied by the type SHORT. Normally the value represents 
the minimum intensity and 65535 represents the maximum intensity and the val- 
ues [0, 0, 0] represent black and [65535,65535, 65535] represent white. If the 
TransferRange tag is present then it is used to determine the minimum and maxi- 
mum values, and a scaling normalization. 

The TransferFunction can be applied to images with a Photometriclnterpretation 
value of RGB, Palette, YCbCr, WhitelsZero, and BlacklsZero. The 
TransferFunction is not used with other Photometriclnterpretation types. 

For RGB Photometriclnterpretation, ReferenceBlackWhite expands the coding 
range, TransferRange expands the range of the TransferFunction, and the 
TransferFunction tables decompand the RGB value. The WhitePoint and 
PrimaryChromaticities further describe the RGB colorimetry. 
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For Palette color Photometric Interpretation, the Colormap maps the pixel into 
three 16-bit values that when scaled to BitsPerSample-bits serve as indices into 
the TransferFunction tables which decompand the RGB value. The WhitePoint 
and PrimaryChromaticities further describe the underlying RGB colorimetry. 

A Palette value can be scaled into a TransferFunction index by: 

index= (value * ((1 « BitsPerSample) - 1)) / 65535; 
A TransferFunction index can be scaled into a Palette color value by: 

value= (index * 65535L) / ((1 « BitsPerSample) - 1); 

Be careful if you intend to create Palette images with a TransferFunction. If the 
Colormap tag is directly converted from a hardware colormap, it may have a 
device gamma already incorporated into the D AC values. 

For YCbCr Photometriclnterpretation, ReferenceBlackWhite expands the coding 
range, the YCbCrCoefficients describe the decoding matrix to transform YCbCr 
into RGB, TransferRange expands the range of the TransferFunction, and the 
TransferFunction tables decompand the RGB value. The WhitePoint and 
PrimaryChromaticities fields provide further description of the underlying RGB 
colorimetry. 

After coding range expansion by ReferenceBlackWhite and TransferFunction 
expansion by TransferRange, RGB values may be outside the domain of the 
TransferFunction. Also, the display device matrix can transform RGB values into 
display device RGB values outside the domain of the device. These values are 
handled in an application-dependent manner. 

For RGB images with non-default ReferenceBlackWhite coding range expansion 
and for YCbCr images, the resolution of the TransferFunction may be insuffi- 
cient. For example, after the YCbCr transformation matrix, the decoded RGB 
values must be rounded to index into the TransferFunction tables. Applications 
needing the extra accuracy should interpolate between the elements of the 
TransferFunction tables. Linear interpolation is recommended. 

For WhitelsZero and BlacklsZero Photometriclnterpretation, the 
TransferFunction decompands the grayscale pixel value to a linear 16-bit form. 
Note that a TransferFunction value of represents black and 65535 represents 
white regardless of whether a grayscale image is WhitelsZero or BlacklsZero. 
For example, the zeroth element of a WhitelsZero TransferFunction table will 
likely be 65535. This extension of the TransferFunction field for grayscale im- 
ages is intended to replace the GrayResponseCurve field. 

The TransferFunction does not describe a transfer characteristic outside of the 
range for ReferenceBlackWhite. 

Default is a single table corresponding to the NTSC standard gamma value of 2.2. 
This table is used for each channel. It can be generated by: 

NValues = 1 << BitsPerSample; 

for (TF[0]= 0, i = 1; i < NValues; i++) 

TF[i]= floor(pow(i / (NValues - 1.0), 2.2) * 65535 + 0.5); 
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TransferRange 

Tag =342(156.H) 

Type = SHORT 

N =6 

Expands the range of the TransferFunction. The first value within a pair is associ- 
ated with TransferBlack and the second is associated with TransferWhite. The 
ordering of pairs is the same as for pixel components of the 

Photometriclnterpretation type. By default, theTransferFunction is defined over a 
range from a minimum intensity, or nominal black, to a maximum intensity,(l 
« BitsPerSample) - 1 or nominal white. Kodak Photo YCC uses an extended 
range TransferFunction in order to describe highlights, saturated colors and 
shadow detail beyond this range. The TransferRange expands the 
TransferFunction to support these values. It is defined only for RGB and YCbCr 
Photometriclnterpretations. 

After ReferenceBlackWhite and/or YCbCr decoding has taken place, an RGB 
value can be represented as a real number. It is then rounded to create an index 
into the TransferFunctiontable. In the absence of a TransferRange tag, or if the 
tag has the default values, the rounded value is an index and the normalized inten- 
sity value is: 

index = (int) (value + (value < 0.0? -0.5 : 0.5)); 

intensity = TF [index] / 65535; 

If the TransferRange tag is present and has non-default values, it provides an 
offset to be used with the rounded index. It also describes a scaling. The normal- 
ized intensity value is: 

index = (int) (value + (value < 0.0? -0.5 : 0.5)); 
intensity = (TF [index + TransferRange [Black] ] - 

TF [TransferRange [Black] ] ) 

/ (TF [TransferRange [White] ] - TF [TransferRange [Black] ]) ; 

An application can write a TransferFunction with a non-defaultTransferRange as 
follows: 

black_offset = scale_f actor * Transfer (-TransferRange [Black] ar / 

(TransferRange [White] - TransferRange [Black] )) ; 
for (i = 0; i < (1 << BitsPerSample); i++) 

TF[i] = floor (0.5 - black_offset + scale_factor 
* Transfer ( (i - TransferRange [Black] ) 
/ (TransferRange [White] - TransferRange [Black] ))) ; 

The TIFF writer chooses scale_factor such that the TransferFunction fits into a 
16-bit unsigned short, and chooses the TransferRange so that the most important 
part of the TransferFunction fits into the table. 

Default is [0, NV, 0, NV, 0, NV] where NV = (1 «BitsPerSample) - 1. 

ReferenceBlackWhite 

Tag =532 (214.H) 
Type = RATIONAL 
N =6 
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Specifies a pair of headroom and footroom image data values (codes) for each 
pixel component. The first component code within a pair is associated with 
ReferenceBlack, and the second is associated with ReferenceWhite. The ordering 
of pairs is the same as those for pixel components of the Photometriclnterpretation 
type. ReferenceBlackWhite can be applied to images with a 
Photometriclnterpretation value of RGB or YCbCr. ReferenceBlackWhite is not 
used with other Photometriclnterpretation values. 

Computer graphics commonly places black and white at the extremities of the 
binary representation of image data; for example, black at code and white at 
code 255. In other disciplines, such as printing, film, and video, there are practical 
reasons to provide footroom codes below ReferenceBlack and headroom codes 
above ReferenceWhite. 

In film applications, they correspond to the densities Dmax and Dmin. In video 
applications, ReferenceBlack corresponds to 7.5 IRE and IRE in systems with 
and without setup respectively, and ReferenceWhite corresponds to 100 IRE 
units. 

Using YCbCr (See Section 21) and the CCIR Recommendation 601.1 video stan- 
dard as an example, code 16 represents ReferenceBlack, and code 235 represents 
ReferenceWhite for the luminance component (Y). For the chrominance compo- 
nents, Cb and Cr, code 128 represents ReferenceBlack, and code 240 represents 
ReferenceWhite. With Cb and Cr, the ReferenceWhite value is used to code 
reference blue and reference red respectively. 

The full range component value is converted from the code by: 

FullRangeValue = (code - ReferenceBlack) * CodingRange 
/ (ReferenceWhite - ReferenceBlack); 

The code is converted from the full-range component value by: 

code = (FullRangeValue * (ReferenceWhite - ReferenceBlack) 
/ CodingRange) + ReferenceBlack; 

For RGB images and the Y component of YCbCr images, CodingRange is de- 
fined as: 

CodingRange = 2 ** BitsPerSample - 1; 

For the Cb and Cr components of YCbCr images, CodingRange is defined as: 

CodingRange = 127; 

For RGB images, in the default special case of no headroom or footroom, this 
conversion can be skipped because the scaling multiplier equals 1.0 and the value 
equals the code. 

For YCbCr images, in the case of no headroom or footroom, the conversion for Y 
can be skipped because the value equals the code. For Cb and Cr, ReferenceBlack 
must still be subtracted from the code. In the general case, the scaling multiplica- 
tion for the Cb and Cr component codes can be factored into the YCbCr transform 
matrix. 

Useful ReferenceBlackWhite values for YCbCr images are: 

[0/1, 255/1,128/1, 255/1, 128/1, 255/1] 

no headroom/footroom 
[15/1, 235/1, 128/1, 240/1, 128/1, 240/1] 

CCIR Recommendation 601.1 headroom/footroom 
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Useful ReferenceBlackWhite values for BitsPerSample = 8,8,8 Class R images 
are: 

[0/1, 255/1,0/1, 255/1, 0/1, 255/1] 

no headroom/footroom 
[16/1, 235/1, 16/1, 235/1, 16/1, 235/1] 

CCIR Recommendation 601.1 headroom/footroom 
Default is [0/,NV/l, 0/1, NV/1, 0/1, NV/1] where NV = 2 ** BitsPerSample - 1. 
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Section 21 : YC b C r Images 



Introduction 



Digitizers of video sources that create RGB data are becoming more capable and 
less expensive. The RGB color space is adequate for this purpose. However, for 
both digital video and image compression applications a color difference color 
space is needed. The television industry depends on YC b C r for digital video. For 
image compression, subsampling the chrominance components allows for greater 
compression. TIFF YC b C r (which we shall call Class Y) supports these images and 
applications. 

Class Y is based on CCIR Recommendation 601-1, "Encoding Parameters of 
Digital Television for Studios." Class Y also has parameters that allow the de- 
scription of related standards such as CCIR Recommendation 709 and technologi- 
cal variations such as component-sample positioning. 

YC b C r is a distinct Photometriclnterpretation type. RGB pixels are converted to 
and from YC b C.for storage and display. 

Class Y defines the following fields: 

YC. C Coefficients transformation from RGB to YC C 

b r b r 

YC b C r SubSampling subsampling of the chrominance components 

YC b C r Positioning positioning of chrominance component samples relative 

to the luminance samples 

In addition, ReferenceBlackWhite, which specifies coding range expansion, is 
required by Class Y. See Section 20. 

Class Y YC b C images have three components: Y, the luminance component, and 
C b and C, two chrominance components. Class Y uses the international standard 
notation YC b C r for color-difference component coding. This is often incorrectly 
called YUV, which properly applies only to composite coding. 

The transformations between YC, C and RGB are linear transformations of 

D r 

uninterpreted RGB sample data, typically gamma-corrected values. The 
YC b C r Coefficients field describes the parameters of this transformation. 

Another feature of Class Y comes from subsampling the chrominance compo- 
nents. A Class Y image can be compressed by reducing the spatial resolution of 
chrominance components. This takes advantage of the relative insensitivity of the 
human visual system to chrominance detail. The YC b CSubSampling field de- 
scribes the degree of subsampling which has taken place. 

When a Class Y image is subsampled, each C b and C r sample is associated with a 
group of luminance samples. The YC b C Positioning field describes the position of 
the chrominance component samples relative to the group of luminance samples: 
centered or cosited. 

Class Y requires use of the ReferenceBlackWhite field. This field expands the 
coding range by describing the reference black and white values for the different 
components that allow headroom and footroom for digital video images. Since the 
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default for ReferenceBlackWhite is inappropriate for Class Y, it must be used 
explicitly. 

At first, it might seem that the information conveyed by Class Y and the RGB 
Colorimetry section is redundant. However, decoding YC b C r to RGB primaries 
requires the YC b C r fields, and interpretation of the resulting RGB primaries re- 
quires the colorimetry and transfer function information. See the RGB Colorim- 
etry section for details. 



Extensions to Existing Fields 



Class Y images use a distinct Photometriclnterpretation Field value: 

Photometriclnterpretation 

Tag = 262 (106.H) 

Type = SHORT 

N =1 

This Field indicates the color space of the image. The new value is: 

YCC 

b r 

A value of 6 indicates that the image data is in the YC b C r color space. TIFF uses 
the international standard notation YC b C r for color-difference sample coding. Y is 
the luminance component. C b and C r are the two chrominance components. RGB 
pixels are converted to and from YCC form for storage and display. 



Fields Defined in Class Y 



YC h C Coefficients 

b i 
Tag =529(211.H) 

Type = RATIONAL 

N =3 

The transformation from RGB to YCC image data. The transformation is speci- 
fied as three rational values that represent the coefficients used to compute lumi- 
nance, Y. 

The three rational coefficient values, LumaRed, LumaGreen and LumaBlue, axe 
the proportions of red, green, and blue respectively in luminance, Y. 

Y, C , and C may be computed from RGB using the luminance coefficients 
specified by this field as follows: 

Y = ( LumaRed * R + LumaGreen * G + LumaBlue * B ) 

C = ( B - Y ) / ( 2 - 2 * LumaBlue ) 
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C = ( R - Y ) / ( 2 - 2 * LumaRed ) 

R, G, and B may be computed from YC b C.as follows: 

R = C *(2 -2* LumaRed ) + Y 

G = ( Y - LumaBlue * B - LumaRed * R ) / LumaGreen 

B = C b * ( 2 - 2 * LumaBlue ) + Y 

In disciplines such as printing, film, and video, there are practical reasons to pro- 
vide footroom codes below the ReferenceBlack code and headroom codes above 
ReferenceWhite code. In such cases the values of the transformation matrix used 
to convert from YC b C to RGB must be multiplied by a scale factor to produce 
full-range RGB values. These scale factors depend on the reference ranges speci- 
fied by the ReferenceBlackWhite field. See the ReferenceBlackWhite and 
TransferFunction fields for more details. 

The values coded by this field will typically reflect the transformation specified 
by a standard for YC b C r encoding. The following table contains examples of com- 
monly used values. 

Standard LumaRed LumaGreen LumaBlue 



CCIR Recommendation 601-1 299/1000 587/1000 114/1000 

CCIR Recommendation 709 2125/10000 7154/10000 721/10000 



The default values for this field are those defined by CCIR Recommendation 601- 
1: 299/1000, 587/1000 and 114/1000, for LumaRed, LumaGreen and LumaBlue, 
respectively. 



YC b C r SubSampling 

Tag =530(212.11) 
Type = SHORT 

N =2 

Specifies the subsampling factors used for the chrominance components of a 

YC b C image. The two fields of this field, YC b CSubsampleHorizand 

YC b CSubsampleVert, specify the horizontal and vertical subsampling factors 

respectively. 

The two fields of this field are defined as follows: 

Short 0: YC C ' SubsampleHoriz'- 

1 = ImageWidth of this chroma image is equal to the ImageWidth of the associated 

luma image. 

2 = ImageWidth of this chroma image is halfthe ImageWidth of the associated luma 

image. 

4 = ImageWidth of this chroma image is one-quarter the ImageWidth of the associ- 
ated luma image. 

Short 1: YC b CSubsampleVert: 

1 = ImageLength (height) of this chroma image is equal to the ImageLength of the 
associated luma image. 
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2 = ImageLength (height) of this chroma image is half the ImageLength of the associ- 
ated luma image. 

4 = ImageLength (height) of this chroma image is one-quarter the ImageLength of the 
associated luma image. 

Both C b and C have the same subsampling ratio. Also, YC b C SubsampleVert shall 
always be less than or equal to YC t C SubsampleHoriz. 

ImageWidth and ImageLength are constrained to be integer multiples of 
YC C SubsampleHoriz and YC t C 'SubsampleVert respectively. TileWidth and 
TileLength have the same constraints. RowsPerStrip must be an integer multiple 
of YC b C SubsampleVert. 

The default values of this field are [ 2, 2 ]. 



YC b C Positioning 

Tag =531(213.H) 
Type = SHORT 

N =1 

Specifies the positioning of subsampled chrominance components relative to 
luminance samples. 

Specification of the spatial positioning of pixel samples relative to the other 
samples is necessary for proper image post processing and accurate image presen- 
tation. In Class Y files, the position of the subsampled chrominance components 
are defined with respect to the luminance component. Because components must 
be sampled orthogonally (along rows and columns), the spatial position of the 
samples in a given subsampled component may be determined by specifying the 
horizontal and vertical offsets of the first sample (i.e. the sample in the upper-left 
corner) with respect to the luminance component. The horizontal and vertical 
offsets of the first chrominance sample are denoted Xoffset[0,0] and Yoffset[0,0] 
respectively. Xoffset[0,0] and Yoffset[0,0] are defined in terms of the number of 
samples in the luminance component. 

The values for this field are defined as follows: 
Tag value YC bC r Positioning X and Y offsets of first chrominance sample 

1 centered Xoffset[0,0] = ChromaSubsampleHoriz 12- 0.5 

Yoffset[0,0] = ChromaSubsampleVert 12- 0.5 

2 cosited Xoffset[0,0] = 

Yoffset[0,0] = 



Field value 1 (centered) must be specified for compatibility with industry stan- 
dards such as PostScript Level 2 and QuickTime. Field value 2 (cosited) must be 
specified for compatibility with most digital video standards, such as CCIR Rec- 
ommendation 60 1 - 1 . 

As an example, for ChromaSubsampleHoriz = 4 and ChromaSubsampleVert = 2, 
the centers of the samples are positioned as illustrated below: 
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YCC Positioning = 1 

b r ° 

X X X X 

o 

X X X X 



YCC Positioning = 2 

b r ° 

(X) X X X 

X X X X 



X X X X 

o 

X X X X 



(X) X X X 



X X X X 



X Luminance samples 

( J Chrominance samples 



Proper subsampling of the chrominance components incorporates an anti-aliasing 
filter that reduces the spectral bandwidth of the full-resolution samples. The type 
of filter used for subsampling determines the value of the YC b C r Positioning field. 

For YC b C r Positioning = 1 (centered), subsampling of the chrominance compo- 
nents can easily be accomplished using a symmetrical digital filter with an even 
number of taps (coefficients). A commonly used filter for 2:1 subsampling utilizes 
two taps (1/2,1/2). 

For YCC Positioning = 2 (cosited), subsampling of the chrominance components 
can easily be accomplished using a symmetrical digital filter with an odd number 
of taps. A commonly used filter for 2: 1 subsampling utilizes three taps (1/4,1/2,1/4). 

The default value of this field is 1 . 



Ordering of Component Samples 



This section defines the ordering convention used for Y, C b , and C component 
samples when the PlanarConfiguration field value = 1 (interleaving). For 
PlanarConfiguration = 2, component samples are stored as 3 separate planes, and 
the ordering is the same as that used for other Photometric Interpretation field 
values. 



For PlanarConfiguration = 1, the component sample order is based on the subsam- 
pling factors, ChromaSubsampleHoriz and ChromaSubsampleVert, defined by 
the YC b CSubSampling field. The image data within a TIFF file is comprised of 
one or more "data units", where a data unit is defined to be a sequence of samples: 

• one or more Y samples 

• a C fc sample 

• a C sample 

The Y samples within a data unit are specified as a two-dimensional array having 
ChromaSubsampleVert rows of ChromaSubsampleHoriz samples. 
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Expanding on the example in the previous section, consider a YC b C r image having 
ChromaSubsampleHoriz = 4 and ChromaSubsampleVert = 2: 



Y component 



Cb component Cr component 



Y00 


Y01 


Y02 


Y03 


Y04 


Y05 






Y10 


Yll 


Y12 


Y13 











































CbOO 









CrOO 









For PlanarConfiguration = 1, the sample order is: 

YYYYYYYYCbCrYY 

00' 01' 02' 03' 10' 11' 12' * 13' 00' 00' 04' 05"" 

Minimum Requirements for YCbCr Images 

In addition to satisfying the general Baseline TIFF requirements, a YCbCr file 
must have the following characteristics: 

• SamplesPerPixel = 3. SHORT. Three components representing Y, Cb and Cr. 

• BitsPerSample = 8,8,8. SHORT. 

• Compression = none (1), LZW (5) or JPEG (6). SHORT. 

• Photometriclnterpretation = YC fc C r (6). SHORT. 

• ReferenceBlackWhite = 6 RATIONALS. Specify the reference values for 
black and white. 

If the conversion from RGB is not according to CCIR Recommendation 601-1, 
code YC C Coefficients. 
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Section 22: JPEG Compression 



Introduction 



Image compression reduces the storage requirements of pictorial data. In addition, 
it reduces the time required for access to, communication with, and display of 
images. To address the standardization of compression techniques an international 
standards group was formed: the Joint Photographic Experts Group (JPEG). JPEG 
has as its objective to create a joint ISO/CCITT standard for continuous tone 
image compression (color and grayscale). 

JPEG decided that because of the broad scope of the standard, no one algorithmic 
procedure was able to satisfy the requirements of all applications. It was decided 
to specify different algorithmic processes, where each process is targeted to sat- 
isfy the requirements of a class of applications. Thus, the JPEG standard became a 
"toolkit" whereby the particular algorithmic "tools" are selected according to the 
needs of the application environment. 

The algorithmic processes fall into two classes: lossy and lossless. Those based on 
the Discrete Cosine Transform (DCT) are lossy and typically provide for substan- 
tial compression without significant degradation of the reconstructed image with 
respect to the source image. 

The simplest DCT-based coding process is the baseline process. It provides a 
capability that is sufficient for most applications. There are additional DCT-based 
processes that extend the baseline process to a broader range of applications. 

The second class of coding processes is targeted for those applications requiring 
lossless compression. The lossless processes are not DCT-based and are utilized 
independently of any of the DCT-based processes. 

This Section describes the JPEG baseline, the JPEG lossless processes, and the 
extensions to TIFF defined to support JPEG compression. 



JPEG Baseline Process 



The baseline process is a DCT-based algorithm that compresses images having 8 
bits per component. The baseline process operates only in sequential mode. In 
sequential mode, the image is processed from left to right and top to bottom in a 
single pass by compressing the first row of data, followed by the second row, and 
continuing until the end of image is reached. Sequential operation has minimal 
buffering requirements and thus permits inexpensive implementations. 



The JPEG baseline process is an algorithm which inherently introduces error into 
the reconstructed image and cannot be utilized for lossless compression. The 
algorithm accepts as input only those images having 8 bits per component. Images 
with fewer than 8 bits per component may be compressed using the baseline pro- 
cess algorithm by left justifying each input component within a byte before com- 
pression. 
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Figure 1 . Baseline Process Encoder and Decoder 

A functional block diagram of the Baseline encoding and decoding processes is 
contained in Figure 1 . Encoder operation consists of dividing each component of 
the input image into 8x8 blocks, performing the two-dimensional DCT on each 
block, quantizing each DCT coefficient uniformly, subtracting the quantized DC 
coefficient from the corresponding term in the previous block, and then entropy 
coding the quantized coefficients using variable length codes (VLCs). Decoding 
is performed by inverting each of the encoder operations in the reverse order. 



The DCT 

Before performing the foward DCT, input pixels are level-shifted so that they 
range from -128 to +127. Blocks of 8x8 pixels are transformed with the two- 
dimensional 8x8 DCT: 



F(u, V ) = I 1 C(u)C(v)ELf(x,y) 



cos 



n(2x+l)u n(2y+l)v 



16 



cos- 



16 



and blocks are inverse transformed by the decoder with the Inverse DCT: 



f(x,y) = ^EEC(u)C(v) F(u,v) cos j^ 



K(2x+l)u n(2y+l)v 



cos- 



16 



with u, v,x,y= 0, 1, 2, ... 7 

where x, y = spatial coordinates in the pel domain 
m, v = coordinates in the transform domain 

C(u), C(v) = 1 / sqrt(2) for u, v = 



1 



otherwise 
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Although the exact method for computation of the DCT andlDCTis not subject 
to standardization and will not be specified by JPEG, it is probable that JPEG will 
adopt DCT-conformance specifications that designate the accuracy to which the 
DCT must be computed. The DCT-conformance specifications will assure that 
any two JPEG implementations will produce visually-similar reconstructed im- 
ages. 



Quantization 

The coefficients of the DCT are quantized to reduce their magnitude and increase 
the number of zero-value coefficients. The DCT coefficients are independently 
quantized by uniform quantizers. A uniform quantizer divides the real number 
line into steps of equal size, as shown in Figure 2. The quantization step-size 
applied to each coefficient is determined from the contents of a 64-element quan- 
tization table. 

C (u,v) 



■ F (u,v) 



-3 Q -2 Q -1 Q 1 Q 2 Q 3 Q 

O • 

o • 



Figure 2. Uniform Quantization 

The baseline process provides for up to 4 different quantization tables to be de- 
fined and assigned to separate interleaved components within a single scan of the 
input image. Although the values of each quantization table should ideally be 
determined through rigorous subjective testing which estimates the human 
psycho-visual thresholds for each DCT coefficient and for each color component 
of the input image, JPEG has developed quantization tables which work well for 
CCIR 601 resolution images and has published these in the informational section 
of the proposed standard. 



DC Prediction 

The DCT coefficient located in the upper-left corner of the transformed block 
represents the average spatial intensity of the block and is referred to as the "DC 
coefficient". After the DCT coefficients are quantized, but before they are entropy 
coded, DC prediction is performed. DC prediction simply means that the DC term 
of the previous block is subtracted from the DC term of the current block prior to 
encoding. 
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Zig-Zag Scan 

Prior to entropy coding, the DCT coefficients are ordered into a one-dimensional 
sequence according to a "zig-zag" scan. The DC coefficient is coded first, fol- 
lowed by AC coefficient coding, proceeding in the order illustrated in Figure 3. 




Figure 3. Zig-Zag Scan of DCT Coefficients 



Entropy Coding 

The quantized DCT coefficients are further compressed using entropy coding. 
The baseline process performs entropy coding using variable length codes (VLCs) 
and variable length integers (VLIs). 

VLCs, commonly known as Huffman codes, compress data symbols by creating 
shorter codes to represent frequently -occurring symbols and longer codes for 
occasionally-occurring symbols. One reason for using VLCs is that they are easily 
implemented by means of lookup tables. 

Separate code tables are provided for the coding of DC and AC coefficients. The 
following paragraphs describe the respective coding methods used for coding DC 
and AC coefficients. 



DC Coefficient Coding 

DC prediction produces a "differential DC coefficient" that is typically small in 
magnitude due to the high correlation of neighboring DC coefficients. Each dif- 
ferential DC coefficient is encoded by a VLC which represents the number of 
significant bits in the DC term followed by a VLI representing the value itself. 
The VLC is coded by first determining the number of significant bits, SSSS, in the 
differential DC coefficient through the following table: 



SSSS 



Differential DC Value 




1 

2 
3 
4 
5 





-1,1 

-3,-2, 2,3 

-7..-4,4..7 

-15. .-8, 8. .15 

-31. .-16, 16. .31 
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6 


-63..-32,32..63 


7 


-127..-64.64..127 


8 


-255. .-128, 128. .255 


9 


-511. .-256, 256. .511 


10 


-1023. .-512, 512.. 1023 


11 


-2047..-1024, 1024..2047 


12 


-4095. . -2048, 2048..4095 



SSSS is then coded from the selected DC VLC table. The VLC is followed by a 
VLI having SSSS bits that represents the value of the differential DC coefficient 
itself. If the coefficient is positive, the VLI is simply the low-order bits of the 
coefficient. If the coefficient is negative, then the VLI is the low-order bits of the 
coefficient- 1 . 



AC Coefficient Coding 

In a similar fashion, AC coefficients are coded with alternating VLC and VLI 
codes. The VLC table, however, is a two-dimensional table that is indexed by a 
composite 8-bit value. The lower 4 bits of the 8-bit value, i.e. the column index, is 
the number of significant bits, SSSS, of a non-zero AC coefficient. SSSS is com- 
puted through the same table as that used for coding the DC coefficient. The 
higher-order 4 bits, the row index, is the number of zero coefficients, NNNN, that 
precede the non-zero AC coefficient. The first column of the two-dimensional 
coding table contains codes that represent control functions. Figure 4 illustrates 
the general structure of the AC coding table. 



SSSS - Size of Non-Zero AC Coefs 
01 2... 10 11. ..15 



NNNN 

Length ' 
of 

Zero 
Run 



EOB 








ZRL 



Figure 4. 2-D Run-Size Value Array for AC Coefs 

The shaded portions are undefined in the baseline process 

The flow chart in Figure 5 specifies the AC coefficient coding procedure. AC 
coefficients are coded by traversing the block in the zig-zag sequence and count- 
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ing the number of zero coefficients until a non-zero AC coefficient is encountered. 
If the count of consecutive zero coefficients exceeds 15, then a ZRL code is coded 
and the zero run-length count is reset. When a non-zero AC coefficient is found, 
the number of significant bits in the non-zero coefficient, SSSS, is combined with 
the zero run-length that precedes the coefficient, NNNN, to form an index into the 
two-dimensional VLC table. The selected VLC is then coded. The VLC is fol- 
lowed by a VLI that represents the value of the AC coefficient. This process is 
repeated until the end of the block is reached. If the last AC coefficient is zero, 
then an End of Block (EOB) VLC is encoded. 



Start 



Code (ZRL) 
R = R-16 



K = 
R = 



K= K + 1 



Code R.Coef (K) 
R = 




R= R + 1 





Code (EOB) 



Done 



Figure 5. Encoding Procedure for AC Coefs 



JPEG Lossless Processes 



The JPEG lossless coding processes utilize a spatial-prediction algorithm based 
upon a two-dimensional Differential Pulse Code Modulation (DPCM) technique. 
They are compatible with a wider range of input pixel precision than the DCT- 
based algorithms (2 to 16 bits per component). Although the primary motivation 
for specifying a spatial algorithm is to provide a method for lossless compression, 
JPEG allows for quantization of the input data, resulting in lossy compression and 
higher compression rates. 



Although JPEG provides for use of either the Huffman or Arithmetic entropy- 
coding models by the processes for lossless coding, only the Huffman coding 
model is supported by this version of TIFF. The following is a brief overview of 
the lossless process with Huffman coding. 
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Control Structure 

Much of the control structure developed for the sequential DCT procedures is also 
used for sequential lossless coding. Either interleaved or non-interleaved data 
ordering may be used. 



Coding Model 

The coding model developed for coding the DC coefficients of the DCT is ex- 
tended to allow a number of one-dimensional and two-dimensional predictors for 
the lossless coding function. Each component uses an independent predictor. 



Prediction 

Figure 6 shows the relationship between the neighboring values used for predic- 
tion and the sample being coded. 



c 



B 



Figure 6. Relationship between sample and prediction samples 

Y is the sample to be coded and A, B, and C are the samples immediately to the 
left, immediately above, and diagonally to the left and above. 

The allowed predictors are listed in the following table. 

Selection-value Prediction 






no prediction (differential coding) 


1 


A 


2 


B 


3 


C 


4 


A+B-C 


5 


A+((B-C)/2) 


6 


B+((A-C)/2) 


7 


(A+B)/2 



Selection-value shall only be used for differential coding in the hierarchical 
mode. Selections 1 , 2 and 3 are one-dimensional predictors and selections 4, 5, 6, 
and 7 are two dimensional predictors. The divide by 2 in the prediction equations 
is done by a arithmetic -right-shift of the integer values. 
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The difference between the prediction value and the input is calculated modulo 
2**16. Therefore, the prediction can also be treated as a modulo 2**16 value. In 
the decoder the difference is decoded and added, modulo 2** 16, to the prediction. 



Huffman Coding of the Prediction Error 

The Huffman coding procedures defined for coding the DC coefficients are used 
to code the modulo 2**16 differences. The table for DC coding is extended to 17 
entries that allows for coding of the modulo 2**16 differences. 



Point Transformation Prior to Lossless Coding 

For the lossless processes only, the input image data may optionally be scaled 
(quantized) prior to coding by specifying a nonzero value in the point transforma- 
tion parameter. Point transformation is defined to be division by a power of 2. 

If the point transformation field is nonzero for a component, a point transforma- 
tion of the input is performed prior to the lossless coding. The input is divided by 
2**Pt, where Pt is the value of the point transform signaling field. The output of 
the decoder is rescaled to the input range by multiplying by 2**Pt. Note that the 
scaling of input and output can be performed by arithmetic shifts. 



Overview of the JPEG Extension to TIFF 



In extending the TIFF definition to include JPEG compressed data, it is necessary 
to note the following: 



• JPEG is effective only on continuous-tone color spaces: 

Grayscale (Photometric Interpretation =1) 

RGB (Photometric Interpretation = 2) 

CMYK (Photometric Interpretation = 5) (See the CMYK Images section.) 

YC b C r (Photometric Interpretation = 6) (See the YCbCr images section.) 

• Color conversion to YC b C is often used as part of the compression process 
because the chrominance components can be subsampled and compressed to a 
greater degree without significant visual loss of quality. Fields are defined to 
describe how this conversion has taken place and the degree of subsampling 
employed (see the YCbCr Images section). 

• New fields have been defined to specify the JPEG parameters used for com- 
pression and to allow quantization tables and Huffman code tables to be incor- 
porated into the TIFF file. 
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• TIFF is compatible with compressed image data that conforms to the syntax of 
the JPEG interchange format for compressed image data. Fields are defined 
that may be utilized to facilitate conversion from TIFF to interchange format. 

• The PlanarConfiguration Field is used to specify whether or not the com- 
pressed data is interleaved as defined by JPEG. For any of the JPEG DCT- 
based processes, the interleaved data units are coded 8x8 blocks rather than 
component samples. 

• Although JPEG codes consecutive image blocks in a single contiguous 
bitstream, it is extremely useful to employ the concept of tiles in an image. The 
TIFF Tiles section defines some new fields for tiles. These fields should be 
stored in place of the older fields for strips. The concept of tiling an image in 
both dimensions is important because JPEG hardware may be limited in the 
size of each block that is handled. 

• Note that the nomenclature used in the TIFF specification is different from the 
JPEG Draft International Standardittee Draft (ISO DIS 109 18-1) in some 
respects. The following terms should be equated when reading this Section: 

TIFF name JPEG DIS name 

ImageWidth Number of Pixels 

ImageLength Number of Lines 

SamplesPerPixel Number of Components 

JPEGQTable Quantization Table 

JPEGDCTable Huffman Table for DC coefficients 

JPEG ACTable Huffman Table for AC coefficients 



Strips and Tiles 



The JPEG extension to TIFF has been designed to be consistent with the existing 
TIFF strip and tile structures and to allow quick conversion to and from the 
stream-oriented compressed image format defined by JPEG. 



Compressed images conforming to the syntax of the JPEG interchange format can 
be converted to TIFF simply by defining a single strip or tile for the entire image 
and then concatenating the TIFF image description fields to the JPEG compressed 
image data. The strip or tile offset field points directly to the start of the entropy 
coded data (not to a JPEG marker). 

Multiple strips or tiles are supported in JPEG compressed images using restart 
markers. Restart markers, inserted periodically into the compressed image data, 
delineate image segments known as restart intervals. At the start of each restart 
interval, the coding state is reset to default values, allowing every restart interval 
to be decoded independently of previously decoded data. TIFF strip and tile off- 
sets shall always point to the start of a restart interval. Equivalently, each strip or 
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tile contains an integral number of restart intervals. Restart markers need not be 
present in a TIFF file; they are implicitly coded at the start of every strip or tile. 

To maximize interchangeability of TIFF files with other formats, a restriction is 
placed on tile height for files containing JPEG -compressed image data conform- 
ing to the JPEG interchange format syntax. The restriction, imposed only when 
the tile width is shorter than the image width and when the 
JPEGInterchangeFormat Field is present and non-zero, states that the tile height 
must be equal to the height of one JPEG Minimum Coded Unit (MCU). This 
restriction ensures that TIFF files may be converted to JPEG interchange format 
without undergoing decompression. 



Extensions to Existing Fields 



JPEG Fields 



Compression 

Tag =259(103.H) 
Type = SHORT 
N =1 

This Field indicates the type of compression used. The new value is: 
6= JPEG 



JPEGProc 

Tag =512(200.H) 
Type = SHORT 

N =1 

This Field indicates the JPEG process used to produce the compressed data. The 
values for this field are defined to be consistent with the numbering convention 
used in ISO DIS 109 1 8-2. Two values are defined at this time. 

1= Baseline sequential process 

14= Lossless process with Huffman coding 

When the lossless process with Huffman coding is selected by this Field, the 
Huffman tables used to encode the image are specified by the JPEGDCTables 
field, and the JPEG ACTables field is not used. 

Values indicating JPEG processes other than those specified above will be de- 
fined in the future. 
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Not all of the fields described in this section are relevant to the JPEG process 
selected by this Field. The following table specifies the fields that are applicable 
to each value defined by this Field. 



Tag Name 


JPEGProc=l 


JPEGProc=14 


JPEGInterchangeFormat 


X 


X 


JPEGInterchangeFormatLength 


X 


X 


JPEGRestart Interval 


X 


X 


JPEGLosslessPredictors 




X 


JPEGPointTransforms 




X 


JPEGQTables 


X 




JPEGDCTables 


X 


X 


JPEGACTables 


X 





This Field is mandatory whenever the Compression Field is JPEG (no default). 

JPEGInterchangeFormat 

Tag =513(201.H) 
Type = LONG 

N =1 

This Field indicates whether a JPEG interchange format bitstream is present in the 
TIFF file. If a JPEG interchange format bitstream is present, then this Field points 
to the Start of Image (SOI) marker code. 

If this Field is zero or not present, a JPEG interchange format bitstream is not 
present. 

JPEGInterchangeFormatLength 

Tag =514(202.H) 
Type = LONG 

N =1 

This Field indicates the length in bytes of the JPEG interchange format bitstream. 
This Field is useful for extracting the JPEG interchange format bitstream without 
parsing the bitstream. 

This Field is relevant only if the JPEGInterchangeFormat Field is present and is 
non-zero. 

JPEGRestartlnterval 

Tag = 5 15 (203. H) 
Type = SHORT 

N =1 



105 



TIFF 6.0 Specification Final — June 3, 1992 



This Field indicates the length of the restart interval used in the compressed image 
data. The restart interval is defined as the number of Minimum Coded Units 
(MCUs) between restart markers. 

Restart intervals are used in JPEG compressed images to provide support for 
multiple strips or tiles. At the start of each restart interval, the coding state is reset 
to default values, allowing every restart interval to be decoded independently of 
previously decoded data. TIFF strip and tile offsets shall always point to the start 
of a restart interval. Equivalently, each strip or tile contains an integral number of 
restart intervals. Restart markers need not be present in a TIFF file; they are im- 
plicitly coded at the start of every strip or tile. 

See the JPEG Draft International Standard (ISO DIS 10918-1) for more informa- 
tion about the restart interval and restart markers. 

If this Field is zero or is not present, the compressed data does not contain restart 
markers. 



JPEGLosslessPredictors 

Tag = 5 17 (205. H) 

Type = SHORT 

N = SamplesPerPixel 

This Field points to a list of lossless predictor-selection values, one per compo- 
nent. 
The allowed predictors are listed in the following table. 

Selection-value Prediction 



1 


A 


2 


B 


3 


C 


4 


A+B-C 


5 


A+((B-C)/2) 


6 


B+((A-C)/2) 


7 


(A+B)/2 



A, B, and C are the samples immediately to the left, immediately above, and 
diagonally to the left and above the sample to be coded, respectively. 

See the JPEG Draft International Standard (ISO DIS 109 18- 1) for more details. 

This Field is mandatory whenever the JPEGProc Field specifies one of the 
lossless processes (no default). 



JPEGPointTransforms 

Tag =518(206.H) 

Type = SHORT 

N = SamplesPerPixel 
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This Field points to a list of point transform values, one per component. This Field 
is relevant only for lossless processes. 

If the point transformation value is nonzero for a component, a point transforma- 
tion of the input is performed prior to the lossless coding. The input is divided by 
2**Pt, where Pt is the point transform value. The output of the decoder is rescaled 
to the input range by multiplying by 2**Pt. Note that the scaling of input and 
output can be performed by arithmetic shifts. 

See the JPEG Draft International Standard (ISO DIS 109 18- 1) for more details. 
The default value of this Field is for each component (no scaling). 



JPEGQTables 

Tag =519(207.H) 

Type = LONG 

N = SamplesPerPixel 

This Field points to a list of offsets to the quantization tables, one per component. 
Each table consists of 64 BYTES (one for each DCT coefficient in the 8x8 block). 
The quantization tables are stored in zigzag order. 

See the JPEG Draft International Standard (ISO DIS 109 18-1) for more details. 

It is strongly recommended that, within the TIFF file, each component be assigned 
separate tables. This Field is mandatory whenever the JPEGProc Field specifies a 
DCT-based process (no default). 

JPEGDCTables 

Tag = 520 (208.H) 

Type = LONG 

N = SamplesPerPixel 

This Field points to a list of offsets to the DC Huffman tables or the lossless 
Huffman tables, one per component. 

The format of each table is as follows: 

16 BYTES of "BITS", indicating the number of codes of lengths 1 to 16; 

Up to 17 BYTES of "VALUES", indicating the values associated with 
those codes, in order of length. 

See the JPEG Draft International Standard (ISO DIS 109 18-1) for more details. 

It is strongly recommended that, within the TIFF file, each component be assigned 
separate tables. This Field is mandatory for all JPEG processes (no default). 

JPEGACTables 

Tag =521(209.H) 

Type = LONG 

N = SamplesPerPixel 
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This Field points to a list of offsets to the Huffman AC tables, one per component. 
The format of each table is as follows: 

16 BYTES of "BITS", indicating the number of codes of lengths 1 to 16; 

Up to 256 BYTES of "VALUES", indicating the values associated with 
those codes, in order of length. 

See the JPEG Draft International Standard (ISO DIS 109 18-1) for more details. 

It is strongly recommended that, within the TIFF file, each component be assigned 
separate tables. This Field is mandatory whenever the JPEGProc Field specifies a 
DCT-based process (no default). 
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Minimum Requirements for TIFF with JPEG Compression 

The table on the following page shows the minimum requirements of a TIFF file 
that uses tiling and contains JPEG data compressed with the Baseline process. 



References 



Tag = NewSubFileType (254) 
Type = Long 
Length = 1 
Value = 


Single image 


Tag = ImageWidth (256) 
Type = Long 
Length = 1 
Value = ? 




Tag = Image Length (257) 
Type = Long 
Length = 1 
Value = ? 




Tag = B i t s P e r l: = T.p 1 - -25: 

Type = Short 

Length = SamplesPerPixel 


8 : Monochrome 
8,8,8 : RGB 
8,8,8 : YCbCr 
8,8,8,8 : CMYK 


Tag = Compressio- ■: 2 5 i ■ 
Type = Long 
Length = 1 
Value = 6 


'• : JPEG compression 


Tag = Photone-ricI~t-rpretcL-icn (262) 
Type = Short 
Length = 1 
Value = ? 


0,1 

2 

5 


: Monochrome 

RGB 

CMYK 

YCbCr 


Tag = SamplesPerPixel (277) 
Type = Short 
Length = 1 
Value = ? 


1 
3 
3 

-. 


Monochrome 
RGB 
YCbCr 
CMYK 


Tag = X?.esol-jt id- (282) 
Type = Rational 
Length = 1 
Value = ? 




Tag = YRe solution (283) 
Type = Rational 
Length = 1 
Value = ? 




Tag = Plana rConfigurat ion (284) 
Type = Short 
Length = 1 
Value = ? 


1 : Block Interleaved 

2 : Not interleaved 


Tag = ResolutionUnit (296) 
Type = Short 
Length = 1 
Value = ? 




Tag = TileWidth (322) 
Type = Short 
Length = 1 
Value = ? 


Multiple of 8 


Tag = TileLength (323) 
Type = Short 
Length = 1 
Value = ? 


Multiple of 8 


Tag = Tile Off set; :■!-. 
Type = Long 
Length = Number of tiles 
Value = ? 




Tag = TileByteCounts (325) 
Type = Long 
Length = Number of tiles 
Value = ? 




Tag = JPEGProc (512) 
Type = Short 
Length = 1 
Value = ? 




Tag = JPEGQTables (519) 

Type = Long 

Length = SamplesPerPixel 

Value = ? 


Offsets to tables 


Tag = JPEGDCTables (520) 

Type = Long 

Length = SamplesPerPixel 

Value = ? 


Offsets to tables 


Tag = JPEGACTables (521) 

Type = Long 

Length = SamplesPerPixel 

Value = ? 


Offsets to tables 



[1] Wallace, G., "Overview of the JPEG Still Picture Compression Algorithm", 
Electronic Imaging East '90. 

[2] ISO/IEC DIS 10918-1, "Digital Compression and Coding of Continuous-tone 
Still Images", Sept. 1991. 
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Section 23: CIE L*a*b* Images 



What is CIE L*a*b*7 

CIE La*b* is a color space that is colorimetric, has separate lightness and chroma 
channels, and is approximately perceptually uniform. It has excellent applicability 
for device-independent manipulation of continuous tone images. These attributes 
make it an excellent choice for many image editing functions. 

1976 CIEL*a*b* is represented as a Euclidean space with the following three 
quantities plotted along axes at right angles: L* representing lightness, a* repre- 
senting the red/green axis, and b* representing the yellow/blue axis. The formulas 
for 1976 CIE L*a*b* follow: 

L*=\ 16(y/F„) 1/3 -16 for Y/Y n > 0.008856 

L*=9033(Y/Y n ) for Y/Y n <= 0.008856 *see note below. 

a*=50o[(X9c„) 1/3 -(y/r„) 1/3 ] 

b*=W)[{Y/Y n ) m -{Ztt n ) m \ 

where X ,Y , and Z are the CIE X, Y, and Z tristimulus values of an appropriate 
reference white. Also, if any of the ratios X/X t , Y/Y ' , or Z/Z is equal to or less than 
0.008856, it is replaced in the formulas with 

7.787F+ 16/116, 

where F is XJX t , Y/Y , or Z/Z , as appropriate (note: these low-light conditions are 
of no relevance for most document-imaging applications). Tiff is defined such 
that each quantity be encoded with 8 bits. This provides 256 levels of L* lightness; 
256 levels (+/- 127) of a* and 256 levels (+/- 127) of/?*. Dividing the 0-100 
range of L* into 256 levels provides lightness steps that are less than half the size 
of a "just noticeable difference". This eliminates banding, even under conditions 
of substantial tonal manipulation. Limiting the theoretically unbounded a* and b* 
ranges to +/- 1 27 allows encoding in 8 bits without eliminating any but the most 
saturated self-luminous colors. It is anticipated that the rare specialized applica- 
tions requiring support of these extreme cases would be unlikely to use CIELAB 
anyway. All object colors, in fact all colors within the theoretical MacAdam lim- 
its, fall within the +/- 127 a */b* range. 
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The TIFF CIELAB Fields 



Photometriclnterpretation 

Tag =262(106.H) 
Type = SHORT 

N =1 

8= 1976CIEL*a*fc* 



Usage of other Fields. 

BitsPerSample: 8 

SamplesPerPixel - ExtraSamples: 3 for L*a*b*, 1 implies L* only, for mono- 
chrome data. 

Compression: same as other multi-bit formats. JPEG compression applies. 

PlanarConfiguration: both chunky and planar data could be supported. 

WhitePoint: does not apply 

PrimaryChromaticities: does not apply. 

TransferFunction: does not apply 

Alpha Channel information will follow the lead of other data types. 

The reference white for this data type is the perfect reflecting diffuser ( 100% 
diffuse reflectance at all visible wavelengths). The L* range is from (perfect 
absorbing black) to 100 (perfect reflecting diffuse white). The a* and b* ranges 
will be represented as signed 8 bit values having the range - 1 27 to + 1 27 . 

Converting between RGB and CIELAB, a Caveat 

The above CIELAB formulae are derived from CIEXYZ. Converting from 
CIELAB to RGB requires an additional set of formulae for converting between 
RGB and XYZ. For standard NTSC primaries these are: 

0.60700.17400.2000 R X 

0.29900.58700.1140 * G = Y 

0.00000.06601.1110 B Z 

Generally, D65 illumination is used and a perfect reflecting diffuser is used for the 
reference white. 

Since CIELAB is not a directly displayable format, some conversion to RGB will 
be required. While look-up table accelerated CIELAB to RGB conversion is 
certainly possible and fast, TIFF writers may choose to include a low resolution 
RGB subfile as an integral part of TIFF CIELAB. 
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Color Difference Measurements in CIELAB 



The differences between two colors in L*, a*, and b* are denoted by DL*, Da*, 
and Db*, respectively, with the total (3-dimensional) color difference represented 



A£* ab = [(A£*) 2 +(Aa*) 2 +(A/7*) 2 ] 1/2 . 

This color difference can also be expressed in terms of L* C* and a measure of 
hue. In this case, h , is not used because it is an angular measure and cannot be 

ab c 

combined with L* and C* directly. A linear-distance form of hue is used instead: 

CIE 1976 a,b hue-difference, AH*^ 

AH* ab = [(AE*) 2 -(AL*) 2 -(AC*) 2 ] 1/2 . 

where DC* is the chroma difference between the two colors. The total color dif- 
ference expression using this hue-difference is: 



A£* ab = [(AL*) 2 +(A/f *) 2 +(Afc*) 2 ] 



1/2 



It is important to remember that color difference is 3-dimensional: much more can 
be learned from a DL*a*b* triplet than from a single DE value. The DL*C*H* 
form is often the most useful since it gives the error information in a form that has 
more familiar perception correlates. Caution is in order, however, when using 
DH* for large hue differences since it is a straight-line approximation of a curved 
hue distance. 



The Merits of CIELAB 



Colo rim etric. 

First and foremost, CIELAB is colorimetric. It is traceable to the internationally- 
recognized standard CIE 1931 Standard Observer. This insures that it encodes 
color in a manner that is accurately modeled after the human vision system. Col- 
ors seen as matching are encoded identically, and colors seen as not matching are 
encoded differently. CIELAB provides an unambiguous definition of color with- 
out the necessity of additional information such as with RGB (primary 
chromaticities, white point, and gamma curves). 



Device Independent. 



Unlike RGB spaces which associate closely with physical phosphor colors, 
CIELAB contains no device association. CIELAB is not tailored for one device or 
device type at the expense of all others. 
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Full Color Gamut. 

Any one image or imaging device usually encounters a very limited subset of the 
entire range of humanly-perceptible color. Collectively, however, these images 
and devices span a much larger gamut of color. A truly versatile exchange color 
space should encompass all of these colors, ideally providing support for all vis- 
ible color. RGB, Photo YCC, YCbCr, and other display spaces suffer from gamut 
limitations that exclude significant regions of easily printable colors. CIELAB is 
defined for all visible color. 



Efficiency 

A good exchange space will maximize accuracy of translations between itself and 
other spaces. It will represent colors compactly for a given accuracy. These at- 
tributes are provided through visual uniformity. One of the greatest disadvantages 
of the classic CIE system (and RGB systems as well) is that colors within it are not 
equally spaced visually. Encoding full-color images in a linear-intensity space, 
such as the typical RGB space or, especially, the XYZ space, requires a very large 
range (greater than 8-bits/primary) to eliminate banding artifacts. Adopting a non- 
linear RGB space improves the efficiency but not nearly to the extent as with a 
perceptually uniform space where these problems are nearly eliminated. A uni- 
form space is also more efficiently compressed (see below). 



Public Domain / Single Standard 

CIELAB maintains no preferential attachments to any private organization. Its 
existence as a single standard leaves no room for ambiguity. Since 1976, CIELAB 
has continually gained popularity as a widely -accepted and heavily-used standard. 



Luminance/Chrominance Separation. 

The advantages for image size compression made possible by having a separate 
lightness or luminance channel are immense. Many such spaces exist. The degree 
to which the luminance information is fully-isolated into a single channel is an 
important consideration. Recent studies (Kasson and Plouffe of IBM) support 
CIELAB as a leading candidate placing it above CIELUV, YIQ, YUV, YCC, and 
XYZ. 

Other advantages support a separate lightness or luminance channel. Tone and 
contrast editing and detail enhancement are most easily accomplished with such a 
channel. Conversion to a black and white representation is also easiest with this 
type of space. 

When the chrominance channels are encoded as opponents as with CIELAB, 
there are other compression, image manipulation, and white point handling ad- 
vantages. 
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Compressibility (Data). 

Opponent spaces such as CIELAB are inherently more compressible than 
tristimulus spaces such as RGB. The chroma content of an image can be com- 
pressed to a greater extent, without objectionable loss, than can the lightness con- 
tent. The opponent arrangement of CIELAB allows for spatial subsampling and 
efficent compression using JPEG. 



Compressibility (Gamut). 

Adjusting the color range of an image to match the capabilities of the intended 
output device is a critical function within computational color reproduction. Lu- 
minance/chrominance separation, especially when provided in a polar form, is 
desirable for facilitating gamut compression. Accurate gamut compression in a 
tri-linear color space is difficult. 

CIELAB has a polar form (metric hue angle, and metric chroma, described be- 
low) that serves compression needs fairly well. Because CIELAB is not perfectly 
uniform, problems can arise when compressing along constant hue lines. Notice- 
able hue errors are sometimes introduced. This problem is no less severe with 
other contending color spaces. 

This polar form also provides advantages for local color editing of images. The 
polar form is not proposed as part of the TIFF addition. 



Getting the Most from CIELAB 



Image Editors 

The advantages of image editing within a perceptually uniform polar color space 
are tremendous. A detailed description of these advantages is beyond the scope of 
this section. As previously mentioned, many common tonal manipulation tasks 
are most efficiently performed when only a single channel is affected. Edge en- 
hancement, contrast adjustment, and general tone-curve manipulation all ideally 
affect only the lightness component of an image. 

A perceptual polar space works excellently for specifying a color range for mask- 
ing purposes. For example, a red shirt can be quickly changed to a green shirt 
without drawing an outline mask. The operation can be performed with a loosely, 
quickly -drawn mask region combined with a hue (and perhaps chroma) range that 
encompasses the shirt's colors. The hue component of the shirt can then bead- 
justed, leaving the lightness and chroma detail in place. 

Color cast adjustment is easily realized by shifting either or both of the chroma 
channels over the entire image or blending them over the region of interest. 



Converting from CIELAB to a device specific space 

For fast conversion to an RGB display, CIELAB can be decoded using 3x3 
matrixing followed by gamma correction. The computational complexity required 
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for accurate CRT display is the same with CIELAB as with extended luminance- 
chrominance spaces. 



Converting CIELAB for accurate printing on CMYK devices requires computa- 
tional complexity no greater than with accurate conversion from any other colori- 
metric space. Gamut compression becomes one of the more significant tasks for 
any such conversion. 
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Part 3: Appendices 



Part 3 contains additional information that is not part of the TIFF specification, 
but may be of use to developers. 
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Appendix A: TIFF Tags Sorted by Number 



TagName 


Decimal 


Hex 


Type 


Number of values 


NewSubfileType 


254 


FE 


LONG 


1 


SubfileType 


255 


FF 


SHORT 


1 


ImageWidth 


256 


100 


SHORT or LONG 


1 


ImageLength 


257 


101 


SHORT or LONG 


1 


BitsPerSample 


258 


102 


SHORT 


SamplesPerPixel 


Compression 


259 


103 


SHORT 


1 


Uncompressed 


1 








CCITT ID 


2 








Group 3 Fax 


3 








Group 4 Fax 


4 








LZW 


5 








JPEG 


6 








PackBits 


32773 








Photometriclnterpretation 


262 


106 


SHORT 


1 


WhitelsZero 











BlacklsZero 


1 








RGB 


2 








RGB Palette 


3 








Transparency mask 


4 








CMYK 


5 








YCbCr 


6 








CIELab 


8 








Threshholding 


263 


107 


SHORT 


1 


CellWidth 


264 


108 


SHORT 


1 


CellLength 


265 


109 


SHORT 


1 


FillOrder 


266 


10A 


SHORT 


1 


DocumentName 


269 


10D 


ASCII 




ImageDescription 


270 


10E 


ASCII 




Make 


271 


10F 


ASCII 




Model 


272 


110 


ASCII 




StripOffsets 


273 


111 


SHORT or LONG 


StripsPerlmage 


Orientation 


274 


112 


SHORT 


1 


SamplesPerPixel 


277 


115 


SHORT 


1 


RowsPerStrip 


278 


116 


SHORT or LONG 


1 


StripByteCounts 


279 


117 


LONG or SHORT 


StripsPerlmage 


MinSampleValue 


280 


118 


SHORT 


SamplesPerPixel 


MaxSampleValue 


281 


119 


SHORT 


SamplesPerPixel 


XResolution 


282 


11A 


RATIONAL 


1 


YResolution 


283 


11B 


RATIONAL 


1 


PlanarConfiguration 


284 


11C 


SHORT 


1 


PageName 


285 


11D 


ASCII 




XPosition 


286 


he 


RATIONAL 




YPosition 


287 


11F 


RATIONAL 




FreeOffsets 


288 


120 


LONG 




FreeByteCounts 


289 


121 


LONG 




GrayResponseUnit 


290 


122 


SHORT 


1 
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GrayResponseCurve 


291 


T40ptions 


292 


T60ptions 


293 


ResolutionUnit 


296 


PageNumber 


297 


TransferFunction 


301 


Software 


305 


DateTime 


306 


Artist 


315 


HostComputer 


316 


Predictor 


317 


WhitePoint 


318 


PrimaryChromaticities 


319 


ColorMap 


320 


HalftoneHints 


321 


TileWidth 


322 


TileLength 


323 


TileOffsets 


324 


TileByteCounts 


325 


InkSet 


332 


InkNames 


333 


NumberOflnks 


334 


DotRange 


336 


TargetPrinter 


337 


ExtraSamples 


338 


SampleFormat 


339 


SMinSampleValue 


340 


SMaxSampleValue 


341 


TransferRange 


342 


JPEGProc 


512 


JPEGInterchangeFormat 


513 


JPEGInterchangeFormatLngth 


514 


JPEGRestartlnterval 


515 


JPEGLosslessPredictors 


517 


JPEGPointTransforms 


518 


JPEGQTables 


519 


JPEGDCTables 


520 


JPEGACTables 


521 


YCbCrCoefficients 


529 


YCbCrSubSampling 


530 


YCbCrPositioning 


531 


Referenc eB lack White 


532 


Copyright 


33432 



123 


SHORT 


2**BitsPerSample 


124 


LONG 


1 


125 


LONG 


1 


128 


SHORT 


1 


129 


SHORT 


2 


12D 


SHORT 


{lor 

SamplesPerPixel}* 
2** BitsPerSample 


131 


ASCII 




132 


ASCII 


20 


13B 


ASCII 




13C 


ASCII 




13D 


SHORT 


1 


13E 


RATIONAL 


2 


13F 


RATIONAL 


6 


140 


SHORT 


3 * (2**BitsPerSample 


141 


SHORT 


2 


142 


SHORT or LONG 


1 


143 


SHORT or LONG 


1 


144 


LONG 


TilesPerlmage 


145 


SHORT or LONG 


TilesPerlmage 


14C 


SHORT 


1 


14D 


ASCII 


total number of charac 
ters in all ink name 
strings, including zeros 


14E 


SHORT 


1 


150 


BYTE or SHORT 


2, or 2* 
NumberOflnks 


151 


ASCII 


any 


152 


BYTE 


number of extra compo 
nents per pixel 


153 


SHORT 


SamplesPerPixel 


154 


Any 


SamplesPerPixel 


155 


Any 


SamplesPerPixel 


156 


SHORT 


6 


200 


SHORT 


1 


201 


LONG 


1 


202 


LONG 


1 


203 


SHORT 


1 


205 


SHORT 


SamplesPerPixel 


206 


SHORT 


SamplesPerPixel 


207 


LONG 


SamplesPerPixel 


208 


LONG 


SamplesPerPixel 


209 


LONG 


SamplesPerPixel 


211 


RATIONAL 


3 


212 


SHORT 


2 


213 


SHORT 


1 


214 


LONG 


2*SamplesPerPixel 


8298 


ASCII 


Any 



118 



TIFF 6.0 Specification Final — June 3, 1992 

Appendix B: Operating System 
Considerations 

Extensions and Filetypes 

The recommended MS-DOS, UNIX, and OS/2 file extension for TIFF files is 
".TIF". 

On an Apple Macintosh computer, the recommended Filetype is "TIFF". It is a 
good idea to also name TIFF files with a ".TIF" extension so that they can easily 
imported if transferred to a different operating system. 
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Index 



Symbols 




CCITT 49 


II 13 






JPEG 95 


image 28 


42 13 




LZW 57 


image file directory 13, 1 


A 




Modified Huffman 43 


image file header 13 




PackBits 42 


ImageDescription 34 


Adobe Developer Su| 


oport 8 


Copyright 31 


ImageLength 18, 27, 34 


alpha data 31 




Count 14, 15, 16 


ImageWidth 18, 27, 34 


associated 77 




D 


InkNames 70 


ANSI IT8 71 




InkSet 70 


Appendices 116 




DateTime 31 


J 


Artist 28 




default values 28 


ASCII 15 




Differencing Predictor 64 


JPEG compression 95 


B 




DocumentName 55 


baseline 95 




DotRange 71 


discrete cosine trans- 


Baseline TIFF 11 




DOUBLE 16 


form 95 


big-endian 13 




Duff, Tom 79 


entropy coding 98 


BitsPerSample 22, 


29 


E 


lossless processes 1 


BlacklsZero 17, 37 




quantization 97 


BYTE data type 15 




ExtraSamples 31 , 77 


JPEGACTables 107 



14 



CCITT 17, 30, 49 
CellLength 29 
CellWidth 29 
chunky format 38 
CIELAB images 110 
clarifications 6 
Class B 21 
Class G 22 
Class P 23 
Class R 25 
Classes 7 
CMYK Images 69 
ColorMap 23, 29 
ColorResponseCurves. See 

TransferFunction 
Compatibility 7 
compliance 12 
component 28 
compositing. See alpha data: 

associated 
compression 17, 30 



Facsimile 49 
file extension 119 
filetype 1 1 9 
FillOrder 32 
FLOAT 1 6 
FreeByteCounts 33 
FreeOffsets 33 



GrayResponseCurve 33, 73, 85 
GrayResponseUnit 33 
Group 3 17, 30 
Group30ptions 51 
Group40ptions 52 

H 

HalftoneHints 72 
Hexadecimal 12 
high fidelity color 69 
HostComputer 34 

I 

IFD. See image file directory 



100 



JPEGDCTables 107 
JPEGInterchangeFormat 105 
JPEGInterchangeFormatLength 105 
JPEGLosslessPredictors 106 
JPEGPointTransforms 106 
JPEGProc 104 
JPEGQTables 107 
JPEGRestartlnterval 105 

K 

no entries 

L 

little-endian 13 
LONG data type 15 
LZW compression 57 

M 

Make 35 

matting. See alpha data: associ- 
ated 

MaxComponentValue 35 

MaxSampleValue. See 
MaxComponentValue 

MinComponentValue 35 

MinSampleValue. See 
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MinComponentValue 


separated images 66 


MM 13 


SHORT data type 15 


Model 35 


SLONG 16 


Modified Huffman compres- 


Software 39 


sion 17, 30, 43 


SRATIONAL 16 


multi-page TIFF files 36 


SSHORT 16 


multiple strips 39 


StripByteCounts 19, 27, <■ 


N 


StripOffsets 19, 27, 40 


StripsPerlmage 39 


NewSubfileType 36 


subfile 16 


NumberOflnks 70 


SubfileType 40. See also 




NewSubfileType 





T 


Offset 1 5 




Orientation 36 


T40ptions 51 




T60ptions 52 


P 


tag 14 




TargetPrinter 71 


PackBits compression 42 


Threshholding 41 


PageName 55 


TIFF 


PageNumber 55 


administration 8 


palette color 23, 29, 37 


Baseline 1 1 


Photometriclnterpretation 17, 32, 37 


Class P 23 


pixel 28 


Class R 24 


planar format 38 


Classes 17 


PlanarConfiguration 38 


consulting 8 


Porter, Thomas 79 


extensions 48 


Predictor 64 


history 4 


PrimaryChromaticities 83 


other extensions 9 


private tags 8 


sample Files 20 


proposals 


scope 4 


submitting 9 


structure 13 


Q 


tags - sorted 1 1 7 


TIFF Advisory Committee 


no entries 


TileByteCounts 68 




TileLength 67 


R 


TileOffsets 68 




Tiles 66 


RATIONAL data type 15 


TilesPerlmage 67, 68 


reduced resolution 36 


C3 ' 

TileWidth 67 


ReferenceBlackWhite 86 


TransferFunction 84 


ResolutionUnit 18, 27, 38 


TransferRange 86 


revision notes 4 
RGB images 37 


transparency mask 36, 31 
type of a field 14 


row interleave 38 




RowsPerStrip 19, 27, 39, 68 


U 


s 


UNDEFINED 16 


sample. See component 


V 


SampleFormat 80 




SamplesPerPixel 39 


no entries 


SBYTE 16 


W 



40 



WhitelsZero 17, 37 
WhitePoint 83 



XPosition 55 
XResolution 19, 27, 41 



YCbCr images 87, 89 
YCbCrCoefficients 90 
YCbCrPositioning 92 
YCbCrSubSampling 91 
YPosition 56 
YResolution 19, 41 



no entries 
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